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Worrying changes in Hungary 


The European country’s autocratic government has made a disturbing grab at the nation’s 


scientific institutions. 


cliché has been true there in recent years, as the ultranationalist 

government of Viktor Orban has tightened its grip. Although 
European Union politicians have watched Hungary's increasingly anti- 
democratic tendencies with mounting concern, researchers have seen 
the nation’s research base begin to flourish, with new internationally 
competitive laboratories. 

This juxtaposition has been achieved because, until now, Hungary 
has left science in the hands of its own experts. And for the most part, 
they have done a splendid job. That situation has now changed. The 
authoritarian government is snatching away scientific autonomy — 
and this should provoke alarm. 

The storm has been gathering since an April election brought a land- 
slide victory for Orban's Fidezs party. In its path is the proud Hungarian 
Academy of Sciences, which has stood independent of politics for more 
than two decades, since the collapse of communism in the region. The 
academy has done sterling work, including creating major research 
grants that allowed many researchers who had been working abroad 
to return to Hungary and establish independent labs. Yet the govern- 
ment’s budget proposal for next year, announced earlier this month, 
would transfer the majority of the academy’s financing into the newly 
created Ministry for Innovation and Technology. 

And last week, Orban dismissed Jézsef Palinkas from the leadership 
of the National Research, Development and Innovation Office, a post 
he has held for three years. Since 2015, Palinkas, a physicist and former 
academy president, has created from scratch a broad portfolio of fund- 
ing programmes ranging from basic science to near-industry research. 
The scheme was a model for how to build a science base founded on 
excellence, and it triggered a welcome reversal to Hungary's previous 
brain drain (see Nature 551, 425-426; 2017). 

According to innovation minister Laszlé Palkovics, the changes are 
to unify innovation and science policy, and to eliminate fragmentation 
of research budgets. On the face of it, that’s reasonable. Palkovics prom- 
ises that the academy money will be filtered back to its various research 
institutions. And any incoming government in a democratic country, 
of course, has the right and mandate to replace key members of staff. 

Yet many researchers in Hungary tell Nature they are worried acad- 
emy money might be returned with strings attached — maybe instruc- 
tions that it should be spent to serve the economy more directly, or that 
historians should glorify their country’s past. Trust is at a low ebb. The 
government’s actions in other areas are becoming ever more extreme. 
On the day of Palinkas'’s dismissal, for example, the parliament approved 
a law that makes helping refugees to apply for asylum in Hungary a 
crime punishable by up to 90 days’ imprisonment. It also approved con- 
stitutional changes that require all state institutions to protect Hungarian 
cultural and Christian values, and that make homelessness illegal. 

Orban has never felt comfortable with what he sees as academia’s 
international and elitist air. A particular bugbear for him has been 
the Central European University (CEU), which was founded in 1991 


[ee writers like to call Hungary a land of contrasts. But the 


by Hungarian-born philanthropist George Soros, and is registered 
in New York state but located in Budapest. A law rushed through in 
April last year required international universities to operate as higher- 
education institutes in their country of origin as well as in Hungary. 
That law affected only the CEU, whose agreement to remain in Hun- 
gary expires at the end of this year. The change attracted an impres- 
sive 70,000 protesters to the streets of Budapest, and the CEU quickly 
r arranged higher-educational activities in the 
The United States to be compliant with the law. 


authoritarian But the government has still not signed off 
government on a new agreement for the university to 
is snatching stay in the country, drafted last September. 
away scientific Negotiations are continuing, but the CEU 


has organized an alternative home for itself 
in Austria, and a transfer there seems increas- 
ingly likely as deadlines for recruiting next year’s students approach. 
That would deprive Hungary of a valuable intellectual hub, and would 
mark another significant step backwards for the country. 

The message to Hungary should be clear: ensure that the gov- 
ernment’s new management and methods continue to uphold the 
principles of meritocratic funding. And maintain the possibilities of 
long-term funding for excellent basic research, to help ensure that a 
strong scientific community can continue to feed the government's 
laudable innovation ambitions. Meanwhile, the 2019 budget, with its 
plan to take control of the Hungarian Academy of Sciences’ funds, 
is scheduled to be approved by mid-July. There is still time for the 
government to reverse its course. It should do so. m 


autonomy.” 


Local science 


Researchers in five Asian economies are 
working to address communities’ needs. 


is to work to improve society. But which society? The needs 

and circumstances of people, communities and regions across 
the world are very different — from energy use and disease threats to 
natural-resource availability and pollution. 

Ina special issue this week, Nature explores how some of these local 
needs are being addressed across five strong science centres in East 
Asia: Hong Kong, Malaysia, Singapore, South Korea and Taiwan. Over 
the past few decades, each member of this diverse group has evolved its 
own model of how to pursue research successfully. Impressively, some 
of their key achievements are those in which they have matched the 
science agenda to explicit and unique local requirements. 


() ne of the most commonly stated goals of science and scientists 
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That is a good model for others to follow, especially given that 
large numbers of people around the world are not well enough 
served by the agendas and interests that drive much of modern 
science. Nature has argued before that more scientists and funders 
should reach out to identify and tackle direct societal challenges in 
this way (Nature 542, 391; 2017). 

Each of the economies we highlight has a unique history that has 
shaped its research and development. Take Malaysia, a peninsula and 
constellation of islands sandwiched between Thailand and Indone- 
sia. In the 1970s, it started to shift its economic reliance on cheap 
products such as tin, rubber and cocoa to higher-value commodities 
such as natural gas and palm oil. It then used applied science to foster 
a booming electronics industry. Some of its major exports today 
include other products of applied research, including chemicals. Yet 
this success has come with relatively low investment in science and 
technology. 

More unusual still, almost half of researchers there are women (see 
page 500). Ina News Feature this week (see page 502), among others, 
we profile Malaysian chemical engineer Suzana Yusup, who leads a 
centre that makes fuels from biomass waste, such as used cooking 
oil, rubber-seed oil and discarded distillate from palm-oil refineries. 
Her career has focused on green technologies that can help the 
environment and society. 

Scientists in Malaysia, which has a predominantly Muslim 
population, are also developing Halal substitute ingredients for 
food, pharmaceuticals and cosmetics, reaping the benefits of a Halal 
economy that, in 2016, was worth US$2 trillion globally. 

Malaysia demonstrates how applied science can generate the 
economic benefits that can allow officials to invest in societal needs. 
And it’s not alone. Singapore, along with South Korea and Taiwan, 
has long focused on applied projects across electronics, physics 
and materials science. The success of these has boosted its gross 
domestic product (GDP). And the Singaporean government is now 
putting some of that money into national priorities — health care and 


biomedical sciences among them. There's a strong push to understand, 
detect and treat heart disease and cancers of the liver, stomach, breast 
and lung, which have a significant impact on Asian populations. 

By most measures, South Korea is an impressive performer in 
science, making it a giant in the region. It invests more than 4% of 
its GDP in science and technology — much of it applied — and has 
a high density of researchers per head of population. Its output of 

scholarly articles has skyrocketed in the past 


“Each of the two decades. 

economies has But South Korea is also choking under 
aunique history acloud of air pollution, and, as physicist 
thathas shaped Han Woong Yeom at Pohang University of 
itsresearchand Science and Technology writes in a Com- 


ment piece (see page 511), its science policy 
must be updated to address this and other 
national needs. That might already be happening. A 2015 analysis of 
development of the regional research and technology organization 
in Gyeonggi province suggested that policymakers had switched 
from a top-down approach to one that emphasizes the “detailed 
analysis of local industry needs” (S. Shin Reg. Stud. Reg. Sci. 2, 
424-431; 2015). 

Not all scientists in conventional research-powerhouse economies 
might welcome such direct targeting of local problems, just as some 
frown when politicians talk up the need for applied science. But there 
does not have to be a trade-off between work that is of international 
quality and work that has a direct local impact. Hong Kong, for 
example, has grown into a hub for researchers investigating emerging 
infectious diseases, such as the avian influenza strain H5N1 and severe 
acute respiratory syndrome (SARS), both of which originated in Asia. 
Those teams published papers in leading journals. But research there 
has also demonstrated that closing live-poultry markets for a day or 
two each month could dramatically reduce the spread of bird flu and 
cut the risk to people. That’s a win-win situation, the likes of which all 
societies should encourage — wherever they are. m 


development.” 


Valediction 


A reflection as the seventh editor-in-chief of 
Nature hands over to the eighth. 


tion’s editor-in-chief. The first was published on 14 December 
1995. A few personal thoughts seem in order. 

Nature’s editorial role since its foundation in 1869 has consistently 
been about support for outstanding science while also being a critically 
minded friend of the research community and its values. Fired by my 
own enthusiasms for astronomy and physics since childhood and as 
a researcher, and by this publication’s ever-broadening interests and 
international ethos, it has been my extraordinary good fortune and 
privilege to work with many researchers and colleagues to help Nature 
to continue and develop in its mission. 

As a journal, Nature has thrived by keeping abreast of some of 
the most inspired and inspiring research — insights into the human 
genome and the microbiome, developments in photovoltaics and the 
extraordinary flowering of exoplanet research are just some examples 
that have been a joy to see. The journal has also gratifyingly grown into 
areas that were well established elsewhere — organic chemistry and 
high-energy physics are two. And the totally unexpected has always felt 
best: Homo floresiensis (‘the Hobbit’) was perhaps my own favourite. 

On the magazine components, a look back at some 1995 issues shows 
how focused Nature then was on narrow rather than widely interesting 
policy news, how little commissioned comment there was relating to the 
research enterprise and its external relationships and how impenetrable 


r | Vhis issue of Nature is the last under my tenure as the publica- 
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some of the language was in our News and Views section. Ever since, it 
has always been my ambition and that of the editorial teams progres- 
sively to open up our pages to more lively and comprehensible fare. 

My regrets include wonderful papers that we failed to attract, and 
that we still have more to do in speeding up our handling of labyrin- 
thine complexities that can arise in retractions and formal critiques 
of our papers. There are initiatives under way towards being more 
attentive in our content to the needs and interests of under-represented 
groups in the population and in the research community, and being 
equivalently more diverse in the make-up of our editorial team. I wish 
Ihad pushed harder on all of these fronts. 

An editor-in-chief has a platform on which to champion readers’ 
needs and interests — and also under-attended causes. Mine have 
included the interests of social sciences, reproducibility, healthy 
research cultures and environments, the tracking of research’s soci- 
etal impacts, and mental-health research. Throughout, my goal has 
been, above all, to make the weekly issue — much of it now published 
continuously online — something that as many as possible of our very 
demanding audience eagerly look forward to. 

Whatever has been achieved, none of it would have been possible 
without great colleagues. Nature’s editorial staff over the past 22-plus 
years has included many inspiringly skilled and visionary individu- 
als. As a result, while there have been some acknowledged missteps, 
the time we have spent has been rich in fulfilment — at least for me, 
possibly for them too, and above all, I hope, for readers. 

As I move on toa new role as editor-in-chief of our publishing com- 
pany Springer Nature, I thank those many people inside and outside the 
research community who have helped to make Nature what it is. Above 
all, I offer the Nature team my profound thanks. I wish them and my 
successor Magdalena Skipper all the very best in their abundant future 
responsibilities and opportunities. Philip Campbell 
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BRIAN STAUFFER 


WORLD VIEW  jernisicos sen 


give one group a political advantage — is not part of anyones 

idea of democracy. Although it is difficult to define gerry- 
mandering precisely, the contorted shapes of electoral districts defy 
simple explanation and imbue a public perception ofa rigged system. 
And when, as in Pennsylvania in 2014, a party captures 72% of its US 
House of Representatives seats with only 55.5% of the statewide vote, 
suspicions are piqued. 

Thus this week’s decision by the Supreme Court, which all but 
squelched hopes for a manageable standard ahead of the 2020 redis- 
tricting cycle, is unwelcome news for those who anticipated that the 
court would take a forceful lead in curtailing partisan gerrymander- 
ing. However, even with such a standard for detecting gerrymanders, 
politicians have shown us that they are extremely savvy when it comes 
to circumventing legal constraints. 

An ounce of prevention is worth a pound of 
cure. I argue that the means of such prevention 
lies not with the courts but in technological 
advances, as long as we are mindful of Supreme 
Court Justice Anthony Kennedy’s admonish- 
ment in 2004 that, for partisan gerrymandering, 
“technology is both a threat and a promise”. 

In the United States, electoral districts are 
redrawn every ten years. In more than two-thirds 
of the states, partisan legislators control congres- 
sional redistricting. A proliferation of software 
that emerged about 30 years ago has facilitated 
the drawing of electoral maps that simultane- 
ously entrench power while meticulously adher- 
ing to legal districting practices. Worse, current redistricting software 
requires experts with political and legal savvy, who generally work in 
secret behind closed doors. Hence, the software has served only to 
advance the threat of technology in redistricting. 

We must now work to enable its promise. 

I develop statistical and computational models that intelligently 
extract information. My research uses the world’s fastest supercom- 
puters in the service of social progress. For redistricting, this means 
devising efficient algorithms that make quadrillions of calculations 
per second on highly sophisticated computing architectures to explore 
how best to ensure fairness in electoral maps. 

The task of redistricting is well suited for computational algorithms 
because the goals can be articulated clearly, performance metrics can 
be specified easily and the tasks are distinct and structured. Moreover, 
computational algorithms are able to present a wide array of possibili- 
ties that capture the interests of diverse societal groups. Perhaps most 
importantly, computers are impervious to the lure of power. 

Because our collective voice is composed of the individual voices of 
many distinct and diverse groups, political fairness is a complex phe- 
nomenon. It requires compromise and balancing competing interests so 
that members ofall groups (racial and ethnic minorities, labour unions, 


CGiveeeor — the manipulation of district boundaries to 


COMPUTERS ARE 


IMPERVIOUS 


TO THE 
LURE 
OF 
POWER. 


Algorithms can foster a 
more democratic society 


Counterbalancing the Supreme Court’s gerrymandering ruling is technology’s 
potential to prevent gerrymanders in the first place, says Wendy K. Tam Cho. 


all socio-economic levels and so forth) are represented. 

Citizens and interest groups can articulate what political fairness 
means to them, but they lack the legal and political expertise to translate 
their goals into actual electoral maps, so their voices are easily muted. 
This is where intelligent computational algorithms can play a part. They 
can search for possible maps that simultaneously adhere to legal thresh- 
olds (for example, compactness, representation of minority groups, per- 
centage of split municipal subdivisions) while fulfilling criteria from 
partisan groups and non-partisan ones, such as the League of Women 
Voters, and Common Cause, which promote competitive voting dis- 
tricts, and groups such as the American Civil Liberties Union, whose 
mission it is to protect the civil rights ofall Americans. Algorithms could 
amalgamate these wide and varied interests to identify electoral maps 
that are acceptable to a broad swathe of society. 

Technological innovation could supply miss- 
ing information that is highly significant for 
improving democratic society. Maps that encom- 
pass competing interests must be made central 
to redistricting discussions and deliberation by 
politicians and independent commissions. 

Of course, algorithms can themselves embody 
bias. Concerns include well-publicized issues 
around ‘predictive-policing’ programs (see 
Nature 558, 357-360; 2018) that aim to deter- 
mine who is at risk of reoffending; these can 
unfairly penalize African Americans. In the 
case of redistricting, however, the algorithms 
are not making decisions, but fostering more- 
inclusive conversations. The criteria are supplied 
by diverse groups with valid competing interests. These maps do not 
become law in secret, but set the stage for deliberative democracy. 
Humans are free to reject and modify them as they see fit. 

That is why my colleague, Yan Liu, and I have been developing 
PEAR (Parallel Evolutionary Algorithm for Redistricting), a compu- 
tational algorithm that integrates Supreme Court mandates and carries 
out intelligent analysis to identify legally viable maps that satisfy an 
array of specific goals. (PEAR is tailored for the United States, but the 
core ideas of exploring redistricting possibilities transfer easily to other 
locales.) Our hope is to move technological advances in the direction 
of supplying objective information that empowers the inclusion of 
diverse societal groups and enhances human deliberation. 

So far, technology for redistricting has led only to the exclusion 
and isolation of power. Moving forward, we must harness the power 
of technology to ensure democracy. The promise of technology is to 
augment human capabilities to engage in productive, inclusive and 
contemplative decision-making about how society is governed. m 


Wendy K. Tam Cho is a professor of political science, law, statistics 
and mathematics at the University of Illinois at Urbana-Champaign. 
e-mail: wendycho@illinois.edu 
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SEVEN DAYS 


FACILITIES 


Telescope boost 
Spain has joined the effort to 
build the world’s largest radio 
telescope, which will probe 
the early Universe. When 


complete, the Square Kilometre 


Array (SKA) will have about 
2,000 radio dishes in South 
Africa and up to 1 million 
in Australia. The project's 


€674-million (US$786-million) 


first phase was scaled back last 
year to cut costs; it will now 
consist of 194 dishes in South 
Africa and about 130,000 
antennas in Australia, and it is 
due to begin construction in 
2020. The project’s governing 
body, the SKA Organisation, 
has had ten members, 
including Sweden, the United 
Kingdom and the two host 
nations, since Germany left 
in 2014. Spain will pay an 
undisclosed membership 

fee to join, and it will enter 
into negotiations about its 


NATURE’S SALARY SURVEY 
If you are a working 
scientist, or you work in 

a science-based job, we 
would love to hear from 
you in Nature’s 2018 salary 
survey. Every two years, 
we ask you about your 

job: what you love, what 
you dislike, what you have 
done so far and what you 
would change. The survey 
takes around 20 minutes 
to complete, and we will 
report on the findings later 
this year. Your answers will 
help us to identify global 
trends in income, benefits 
and job satisfaction, as 
well as help you to make 
more-informed career 
decisions. Survey entrants 
will have the chance to enter 
a prize draw to win a £100 
or US$150 Amazon gift 
card. Take the survey by 

16 July at go.nature.com/ 
salarysurvey2018 


The news in brief 


Gorilla who knew sign language dies 


Koko, a gorilla that learnt an adapted version 

of American Sign Language, died in her 

sleep on 19 June. She was 46 years old. The 
announcement came from the California-based 
non-profit group The Gorilla Foundation, 
whose co-founder, animal psychologist Francine 
Patterson, started teaching Koko sign language 


contribution to phase one. The 
SKA hopes that recruiting new 
members will bring it the funds 
needed to build the full design 
of the first phase, says SKA 
director of communications 
William Garnier. 


POLICY 


Trump makeover 
US President Donald Trump 
released a sweeping plan 

on 21 June to reorganize — 
and shrink — the federal 
government. Oversight of 
fisheries would move from 
the National Oceanic and 
Atmospheric Administration, 
in the commerce dep artment, 
to the interior department. 
The plan also calls for the Food 
and Drug Administration to 
be renamed the Federal Drug 
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Administration, and for its 
responsibility of ensuring 
food safety to be transferred 
to the agriculture department. 
But the proposal, which 
would need to be approved 
by Congress, already faces 
stiff opposition from top 
Democrats and is unlikely 

to take effect. Trump's 
predecessor, Barack Obama, 
tried and failed to implement 
his own government 
reorganization plan in 2012. 


Embryo research 
New Zealand’s guidelines on 
the use of human embryos 

in research are unclear anda 
barrier to infertility studies, 
according to a survey of the 
country’s researchers published 
in The New Zealand Medical 
Journal. Although the use of 


in 1972 while studying the cognitive abilities of 
western lowland gorillas (Gorilla gorilla gorilla). 
Koko made headlines not only for her language 
skills — she reportedly used more than 1,000 
signs and understood around 2,000 words of 
spoken English — but also for adopting kittens, 
the first of which she named All Ball. 


human embryos for research 

is allowed under New Zealand 
law, guidelines developed in 
2005 refer to the use of only 
non-viable embryos, effectively 
banning researchers from 
using viable human embryos, 
say the survey’s authors. Most 
of the 20 human-embryo 
researchers surveyed felt that 
they were disadvantaged by the 
lack of specific guidance, with 
11 saying they had potential 
research projects that they 
couldn't take up under the 
current regulations. 


EU copyright laws 

A set of controversial changes 
to copyright rules in the 
European Union has passed a 
hurdle on the way to becoming 
law. The proposed legislation, 
which covers copyright of 


BETTMAN/GETTY 
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SOURCE: H. D'ALBIS, E. BOUBTANE & D. COULIBALY SCI. ADV. 4, EAAQ0883 (2018). 


digital material in the EU, 
would require social-media 
platforms to pay fees to 
rightholders for any content or 
snippets of content uploaded 
on these platforms. It would 
also require web services, 
including repositories for 
research articles, to actively 
prevent uploads of copyrighted 
material. The legislation 
exempts academic researchers 
from paying fees to perform 
data and text-mining — 
computer-based data searches 
of large amounts of texts — 

on sources they have legal 
access to. But that provision 
does not cover researchers 

at commercial companies, 
prompting fears that it might 
hamper public-private 
research collaborations in the 
EU. The European Parliament's 
legal-affairs committee voted 
in favour of the changes on 

20 June; they must now be 
approved by the Parliament 
and EU member states. 


US ocean policy 

US President Donald Trump 
has written climate change 
out of the country’s plan for 
managing its coastal waters 
and the Great Lakes. On 

19 June, Trump issued a 
national ocean policy that 
emphasizes the development 
of ocean industries, such as 
offshore oil and gas drilling. 
The document replaces a 
policy that former president 
Barack Obama released in 
2010, following the massive 


TREND WATCH 


Asylum seekers and migrants 
searching for safe havens and 


opportunities benefit their host 


nations’ economies within 5 years 


of arrival, suggests an analysis of 


30 years of data from 15 countries 


in Western Europe. Researchers 
modelled how economic 


indicators, such as gross domestic 


product per capita, changed 
following a spike in immigration. 
Their model suggested that 
within two years of an influx of 
migrants, unemployment rates 
drop significantly and economic 
health increases. 


Deepwater Horizon oil 

spill in the Gulf of Mexico 
(pictured). That plan came 
under sustained criticism 
from Republican politicians 
for its focus on environmental 
sustainability and stewardship, 
including several mentions of 
climate change. 


ENERGY 


Power initiatives 
The Chinese Academy of 
Sciences (CAS) has launched 
a large-scale clean-energy 
project. On 19 June, it 
announced that 20 of its 
institutes will join forces to 
develop dozens of renewable- 
energy and energy-efficient 
technologies before 2023. 
Together, they will aim to help 
cut pollution from coal-fired 
power stations by 40-50%, 
and replace 100 million tonnes 
of oil and gas usage. China 

is the world’s largest energy 
consumer; in 2017, it used 
608 million tonnes of oil. On 
15 June, state-owned nuclear 


power developer China 
National Nuclear Corporation 
announced plans to open a 
university in the east-coast 
city of Tianjin to train nuclear 
engineers who will help to 
ramp up nuclear production 
and export nuclear technology. 
China has an ambitious 

plan to have 58 gigawatts of 
nuclear generating capacity 

in operation by 2020, up from 
33.6 gigawatts at the end of 
2016. 


Asteroid plan 


The US government has 
reaffirmed its commitment to 
finding dangerous near-Earth 
asteroids and developing ways 
to deflect them if needed. A 
20 June federal report outlines 
how government agencies 
will cooperate to upgrade 
detection capabilities, such as 
by building new telescopes, 
and how they can collaborate 
on new computer models to 
assess impact hazards. The 


THE ECONOMICS OF MIGRATION 


A mathematical model based on 30 years of data from 15 Western 
European countries suggests that migrants and asylum seekers do not 
burden national economies and instead benefit them. 
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SEVEN DAYS | THIS WEEK | 


government has almost tripled 
its spending on planetary 
defence in the past few years, 
from about US$21 million in 
2013 to $60 million in 2017. 
Still, NASA is only about 
one-third of the way to its 
goal of tracking the roughly 
25,000 near-Earth asteroids 
that are at least 140 metres 
across — those big enough to 
cause serious regional damage 
if they were to hit the planet. 


Scientist sacked 


Hungary's reform-minded 
head of research has been 
dismissed by the country’s 
prime minister, its innovation 
ministry announced on 

20 June. Jézsef Palinkas, a 
physicist and former president 
of the Hungarian Academy of 
Sciences (HAS), was appointed 
head of the National Research, 
Development and Innovation 
Office in 2015, where he 

led an expansion in science 
funding. His dismissal comes 
one week after a government 
proposal that would hand the 
innovation ministry control of 
much of the HAS’ research- 
funding budget. See page 485 
for more. 


Marijuana drug 

For the first time, the US Food 
and Drug Administration 

has approved a treatment 
containing a component 
derived from marijuana. Ina 
25 June announcement, the 
agency approved Epidiolex, an 
antiepileptic drug made from 
cannabidiol. The chemical 

is found in marijuana but 

does not cause psychoactive 
effects. Epidiolex, made by GW 
Pharmaceuticals in Cambridge, 
UK, can now be used to treat 
two epilepsy disorders that 
manifest in childhood, known 
as Lennox-Gastaut syndrome 
and Dravet syndrome. In 
clinical trials, Epidiolex 
reduced the number of seizures 
in some people by more than 
50%. The drug is currently 
under review by the European 
Medicines Agency. 


> NATURE.COM 
For daily news updates see: 
www.nature.com/news 
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NEWSIN FOCUS 


US-China trade 
war raises cost of lab 
equipment and supplies p.494 


on mysterious dark asteroid 


Hayabusa-2 closes in 
Ryugu p.495 


to fight disease p.497 


Genetically modified 
bacteria tested for potential 


The rising 
stars of East Asian science 
show their worth p.499 


Ve 
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The Indian monsoon can bring damaging floods, but also has a crucial role in ensuring an adequate water supply for people and crops. 


ATMOSPHERIC SCIENCE 


Mysteries of Indian 
monsoon probed 


Research plane and ships aim to gather the most detailed data yet on rainfall variations. 


BY ALEXANDRA WITZE 


eavy rains and seven-metre-high 
H== pummelled the research vessel 
Thomas G. Thompson in the Bay of 
Bengal this month, routinely drenching the 
oceanographers on deck. But that was just fine 
with the scientists. Their entire plan involved 
getting as wet as possible, in order to directly 
measure what happens where the air and the 
sea meet in a summer storm. 
The team is part of a multinational group 
of researchers who are descending on the 


Indian Ocean this summer to study its 
seasonal monsoon. They intend to gather 
the most detailed observations yet on the wet 
and dry periods that alternate roughly every 
10-50 days during the monsoon season, which 
lasts from June to September. 

If modellers could better predict these 
varying patterns — called monsoon intra- 
seasonal oscillations, or MISOs — then 
officials could better prepare for the monsoon 
each year. That includes timing the planting of 
crops in concert with the rains, storing water 
behind dams for hydropower, and preparing 


for floods and other natural disasters. 

“One billion people on the Indian 
subcontinent depend on planning for water 
resources,’ says Harindra Fernando, a mechan- 
ical engineer at the University of Notre Dame in 
Indiana and one of the project's leaders. “When 
people are waiting for rain, it’s important to 
knowwhen you will get it and when you won't” 

MISOs represent the monsoons ‘active’ 
and ‘break’ periods, in which weeks of heavy 
rainfall give way to brilliant sunshine before 
starting all over again. The patterns of rainfall 
generally track northwards over the Bay 
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STORMY SKIES 


Each summer, the monsoon brings alternating periods of rain and clear skies to the Indian subcontinent. 
Researchers are trying to understand the factors that drive these wet and dry periods, to better predict and 


prepare for future monsoons. 
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> of Bengal, and sometimes veer towards 
the Indian subcontinent — where they can 
cause serious damage. In 2017, a powerful 
MISO brought torrential rain and landslides 
to Sri Lanka, killing more than 200 people. 
Weather and climate models have not been 
able to accurately predict MISOs'. Strong and 
frequent interactions between the atmosphere 
and the ocean seem to help get them started’, 
as warm ocean waters feed energy into the air 
above (see ‘Stormy skies’). A study published 
this year suggests that certain ocean processes, 
such as a type of wave that helps warm the top- 
most waters, could play a big part in kicking off 
many MISOs°. Having direct measurements 
from within a MISO will help modellers to 
pinpoint the exact conditions that drive them, 
says lead author Jason West, an atmospheric 
scientist at the University of Colorado Boulder. 
The Bay of Bengal project aims to measure 
the microphysics of energy flows between the 


ocean and the atmosphere once MISOs are 
under way. It builds on a long history of inter- 
national field campaigns to understand the 
intricacies of the Indian Ocean monsoon, so 
that neighbouring countries can better prepare 
for it*. Funders of the five-year study, which 
is known as MISO-BOB, include the Indian 
Ministry of Earth Sciences and the US Office 
of Naval Research, working with institutions 
such as Sri Lanka’s National Aquatic Resources 
Research and Development Agency (NARA). 
MISO-BOB’s aerial component started 
on 15 June from an air base in Colombo, 
Sri Lanka. Scientists have been flying aboard 
the US Air Force’s hurricane-hunter C-130 
plane, which carries equipment to measure 
the properties of clouds and the atmosphere. 
It is releasing instrument packages called drop- 
sondes that measure temperature, pressure and 
wind speed as they plummet towards the sea. 
The Thompson, meanwhile, left Chennai, 


India, on 4 June for the first of two research 
legs to gather data on ocean conditions. 
Scientists on board — mostly students and 
early-career researchers — have deployed a 
variety of instruments to measure temperature, 
salinity, currents and other factors at different 
depths and locations across the Bay of Bengal. 
They have also released radiosondes, which 
are instrument packages carried upward by 
weather balloons to gather meteorological data. 

“We want to observe the conditions across 
the air-sea interface cleanly, which is a chal- 
lenging thing to do,’ says Emily Shroyer, an 
oceanographer at Oregon State University in 
Corvallis, who led the first leg. The second leg 
will take another group of scientists on board 
and will wrap up by 22 July. This team will 
travel farther to the south, and an associated 
group will take measurements near Sri Lanka 
with the NARA research vessel Samudrika, 
says NARA oceanographer Priyantha Jinadasa. 

With enough data and analysis, MISO fore- 
casts could improve in perhaps five or ten 
years, says team member Debasis Sengupta, 
a monsoon expert at the Indian Institute of 
Science in Bangalore. 

Project scientists are already planning a 
second, longer season of field observations 
for next summer. Details have not yet been 
finalized, but the team will continue to tar- 
get how energy flows between the air and the 
sea during the monsoon. “There's constantly 
this game going on between the atmosphere 
and the ocean,’ says Amit Tandon, an ocean- 
ographer at the University of Massachusetts 
in Dartmouth. “It’s every bit as exciting as a 
World Cup match between two nations.” = 
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US-China trade war rattles labs 


Trump puts tariffs on Chinese technology and China retaliates with taxes on US chemicals. 


BY ANDREW SILVER 


cientific research in the United States 
could become collateral damage in the 
country’s escalating trade dispute with 
China. Both nations went head-to-head in 
mid-June over tariffs on a long list of goods 
that includes lab equipment and reagents. 
That is likely to increase the cost of scientific 
research, and the impact could be felt more 
keenly in US labs. 
The latest skirmish in the ongoing trade 
war between the world’s two largest econo- 
mies began on 15 June, when the United States 
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announced a 25% tax on 818 goods imported 
from China. The list includes equipment used 
by scientists, such as basic electrical parts, 
microscopes and geological-survey devices. 
President Donald Trump said the tariffs, which 
will start on 6 July, are intended to reduce Chi- 
nas dominance in industries such as robotics, 
new materials and information and communi- 
cations technology, and will level the playing 
field for US firms. The Trump administration 
is considering tariffs on a further 284 indus- 
trial goods, including chemicals. 

A day after the US announcement, China’s 
Ministry of Commerce responded with its own 


set of tariffs on 545 US products imported to 
China, which will also start on 6 July. The 
government will apply taxes in the future to 
another 114 US imports — including basic 
chemicals and medical devices, such as mag- 
netic resonance imaging (MRI) machines — 
although it has not announced a date. 
Scientists in the United States were quick 
to denounce Trump’s latest round of tariffs. “I 
am opposed to these seemingly ad hoc tariffs 
because it will further stretch the already 
anaemic scientific research budgets in this 
country,’ says Thomas Lapen, a geochemist at 
the University of Houston, Texas. Equipment 
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and supplies are the second-largest expense for 
his research, after paying wages. Lapen says 
that his costs are likely to increase because the 
US tariff list includes equipment or parts his 
team needs, such as electrical motors that drive 
centrifuges. 

Priscilla Cushman, a dark-matter physicist 
at the University of Minnesota in Minneapo- 
lis, says that research deans at US universities 
should be scrutinizing the list to see whether 
the taxes will affect their facilities. 


SCIENCE SQUEEZE 
The tariffs could also cause havoc for large- 
scale experiments, such as the ADMX 
dark-matter detector at the University of Wash- 
ington in Seattle, which is under construction. 
The project’s lead scientist, physicist Leslie 
Rosenberg, is worried that the equipment his 
team needs to build experiments — such as 
tools for power generation and distribution, 
and machinery that has Chinese electrical com- 
ponents — could be subject to the latest tariffs. 
“Anyone can see the tariff list, but an official 
must determine whether any particular pro- 
curement falls under the tariff? he says. 
Rosenberg thinks that the United States’ 


overall research capability will probably 
decline under the tariffs. 

But other researchers aren't worried. 
Roberto Refinetti, a biopsychologist who 
studies biological clocks at Boise State Uni- 
versity in Idaho, uses some small Chinese- 
manufactured equipment for his work, such 
as infrared motion detectors for monitoring 
rodents. He doesn't think that the tariffs on 
Chinese goods will significantly increase the 
cost of his research, because he purchases that 
type of equipment infrequently. 

The White House and the Office of the 
United States Trade Representative did not 
respond to Nature’s request for comment on 
researchers’ concerns. 

In China, the tariff dispute could increase the 
cost of standard reagents used in laboratory and 
medical devices that scientists import from the 
United States. Ruibang Luo, a bioinformatician 
at the University of Hong Kong who collabo- 
rates with researchers on the mainland, says 
that if the Chinese government interprets some 
tariff items literally, the taxes could apply to a 
broad range of US-made reagents and research 
devices, including some DNA sequencers. 

But Yu Zhou, a researcher at Vassar College 
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in Poughkeepsie, New York, who studies 
science and technology development in China, 
says that the tariffs would not have a significant 
effect on research projects and experiments in 
China. She says that is because some univer- 
sities have large enough budgets to absorb 
increased costs. Researchers could also share 
more equipment than they do now, or use 
goods made domestically and from countries 
other than the United States. 

Brian Xu, a toxicologist for the scientific 
consulting firm ACTA in Washington DC, 
which works with businesses in China, agrees 
that China’s proposal to place tariffs on US 
chemicals and scientific equipment is unlikely 
to have a major effect on Chinese research. 
He notes that scientists there import only a 
small amount of US-made chemicals, and 
that infrequently replaced scientific equip- 
ment from other countries, such as Japan and 
Germany, is of comparable quality and cost. 

But the latest round of tariffs might not 
be the last. On 18 June, Trump threatened to 
impose additional tariffs on Chinese goods 
ifthe country does not rescind its tariffs and 
create a more balanced trade relationship with 
the United States. = 


Japanese mission reaches 
unexplored asteroid Ryugu 


Hayabusa -2 will release four landing probes before touching down to collect samples. 


BY DAVIDE CASTELVECCHI 


fter travelling for three anda half years, 
A Japanese spacecraft Hayabusa-2 

this week makes its final approach 
to the asteroid Ryugu. The probe will release 
landers on the space rock’s surface later this 
year and bring a precious sample back to ter- 
restrial labs in 2020. It is already giving plan- 
etary scientists their closest-ever view of a 
mysterious kind of asteroid. 

The Japan Aerospace Exploration Agency 
(JAXA) last week released grainy pictures 
froma distance of around 300 kilometres away, 
revealing that Ryugu — an asteroid of a com- 
mon but little-studied type — looks similar to 
a spinning top. 

This week, a much more detailed picture, 
from 40 kilometres away, showed a surface 
strewn with large boulders. Hayabusa-2 will 
continue to inch towards the asteroid until it is 
about 10 kilometres away, which JAXA expects 
will happen around 27 June. Ryugu’s orbit cuts 
between those of Earth and Mars. 


“From a distance, Ryugu initially appeared 
round, then gradually turned into a square 
before becoming a beautiful shape simi- 
lar to fluorite, known as the ‘firefly stone’ in 
Japanese,’ project manager Yuichi Tsuda said 
ina 25 June statement. 

Launched in December 2014, the probe is a 
follow-up to — and near-clone of — Hayabusa, 
which explored the asteroid Itokawa starting 
in 2005. Hayabusa 


was the first mission “Ryugu initially 
to return an aster- appeared round, 
oid sample to Earth. before becoming 
Ryugu is about 1 abeautiful 
kilometre across — shape similar to 


around 3 times wider 
than Itokawa but 
one-quarter the size of the comet 67P/Churyu- 
mov-Gerasimenko, which the European Space 
Agency's Rosetta probe visited between 2014 
and 2016. 

Ryugu is a ‘C-type’ asteroid, which has a 
darker surface than does Itokawa. In 1997, a 
NASA mission called NEAR Shoemaker made 


fluorite.” 


a fly-by of a C-type asteroid from a distance 
of more than 1,000 kilometres. Hayabusa-2 is 
the only spacecraft to have come this close toa 
C-type asteroid, says Lucy McFadden, a plan- 
etary scientist at NASAs Goddard Space Flight 
Center in Greenbelt, Maryland. 

“We don’t know much about C-type aster- 
oids,” says McFadden. But they are expected to 
have a composition similar to that of the early 
Solar System. In particular, Hayabusa-2 will 
determine whether the darkness of Ryugu’s 
surface is due to it being rich in carbon — as is 
often assumed — or to small, metallic particles 
such as magnetite. 

Chemical and isotopic analyses of the rock 
— to be done in space by Hayabusa-2’s landers 
and then in terrestrial labs — could help to 
explain the origins of Earth and, particularly, 
its water. Many researchers think that Earth’s 
oceans formed from a bombardment of water- 
rich asteroids or comets. 

Among the first measurements Hayabusa-2 
made was one of Ryugu’s rotational period, or 
time it takes to make one turn on its own > 
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An artist’s impression of Hayabusa-2. 


> axis, which is about 7.5 hours. This is good 
news, because a much faster rotation could have 
made it harder to approach the surface, says 
mission manager Makoto Yoshikawa of JAX As 
Institute of Space and Astronautical Science in 
Sagamihara. But its shape was surprising, he 
says, because it has a bulge around the equator, 
something that is usually associated with much 
faster-spinning objects, Yoshikawa adds. 
Hayabusa-2’s most important task right now 


is to pinpoint its own position using laser rang- 
ing so that it can manoeuvre accordingly. “We 
want to know the exact distance of the space- 
craft to the asteroid? Yoshikawa says. Also 
crucial is to map the asteroid surface using its 
on-board camera and infrared spectrometer. 
Temperature variations will hint at the com- 
position of the surface. All of these data will 
be crucial for deciding where to release MAS- 
COT, the shoebox-sized lander that will probe 


the asteroid, and the three other small probes 
carried by the mothership. 

“We will use the information we get from the 
mother spacecraft to do landing-site selection,” 
says MASCOT payload manager Stephan 
Ulamec of the German Aerospace Center in 
Cologne. Ulamec was also project manager 
for Philae, a probe that Rosetta released onto 
the surface of 67P/Churyumov-Gerasimenko. 
That approach took several hours because 
Rosetta was orbiting the comet and Philae had 
to spiral down to its surface. 

Hayabusa-2 will simply hover over Ryugu — 
using its own gentle ion engines to counteract 
the asteroid’s gravitational attraction — and 
release MASCOT straight down. Some time 
in October, the lander will make a soft touch- 
down. After MASCOT settles on the surface, an 
internal mechanism will straighten the lander 
up so it can use its on-board instruments and 
communicate with Hayabusa-2. 

MASCOT carries no solar panels and its 
batteries are expected to last only a few hours. 
The team will meet in Toulouse, France, in 
mid-August to make the final selection for the 
landing site of MASCOT and its companions. 

Meanwhile, Hayabusa-2 will make its own, 
brief soft landings to collect samples of the 
asteroid’s surface. Then, in late 2019, it will 
head back to Earth, a journey expected to last 
a year. Compared to the more daring manoeu- 
vres to reach the asteroid’s surface, the current 
part of the mission is relatively low risk, says 
Yoshikawa. But as the craft approaches Ryugu, 
his team has already kicked into high gear, he 
adds: “I do not have much time for sleep.” m 


JAXA 


CLIMATE CHANGE 


Methane leaks from US gas 
fields dwarf official estimates 


Latest study suggests that emissions could be coming from faulty equipment. 


BY GIORGIA GUGLIELMI 


ethane leaks from the US oil and 
M gas industry are 60% greater than 

official estimates, according to an 
analysis of previously reported data and new 
airborne measurements. 

Because methane is a potent greenhouse 
gas, scientists say that the unaccounted-for 
emissions could have significant impacts on 
the climate. 

The analysis’, published on 21 June in 
Science, is one of the most comprehensive looks 
yet at methane output from US oil and gas pro- 
duction, and reinforces previous studies that 
suggested emissions outpaced government 
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estimates. That research prompted the 
government to develop regulations that would 
restrict methane emissions from oil and gas 
production — rules that US President Donald 
Trump is now attempting to roll back. 

The latest study shows that the US oil and 
gas supply chain emits about 13 million tonnes 
of methane, the main component of natural 
gas, every year. That’s much higher than the 
US Environmental Protection Agency (EPA) 
estimate of about 8 million tonnes. 

This discrepancy probably stems from the 
fact that the EPA’s emissions surveys miss 
potential sources of methane leaks, such as 
faulty equipment at oil and gas facilities, says 
study leader Ramon Alvarez, an atmospheric 


chemist at the Environmental Defense Fund, 
anon-profit group in Austin, Texas. 

Methane warms the planet 80 times as much 
as carbon dioxide does over the first 20 years 
after it is released. And atmospheric methane 
contributes to about 25% of global warming, 
Alvarez says. “That's a significant amount.” 

If left unchecked, he says, methane emissions 
from the oil and gas industry could erode the 
potential climate benefits of using natural gas, 
which releases much less CO, and other toxic 
pollutants than coal does when it is burned. 

The latest study comes one year after the 
EPA announced a delay of the rule that would 
restrict methane emissions produced by 
oil and gas drilling operations. The policy, 
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introduced under former president Barack 
Obama, will not take effect until 2019. 
Before 2012, published estimates of the 
US methane leakage rate ranged from 1% to 
about 8%, Alvarez says, and the lack of con- 
sensus pushed scientists to improve meas- 
urements of those rates in subsequent years. 
Alvarez and his team pooled data from some 
of these studies — many of which quanti- 
fied emissions at individual facilities — and 
validated the measurements using aircraft 
surveys. The team covered regions account- 
ing for about 30% of US gas production. 
The researchers then extrapolated the 
figures to estimate methane leaks at the 
national level. The team found a leakage 
rate of 2.3% in 2015, compared with the 1.4% 
estimate from the EPA. The gas is escaping 
through holes in the production system, and 
itadds up to a lot of emissions, says Alvarez. 
The findings reduce the uncertainty 
around the magnitude of US methane 
emissions, says Daniel Zimmerle, an energy 
researcher at Colorado State University in 
Fort Collins. “But I would be surprised if 
this would be the final word on the topic.” 
Because of methane’s warming potential, 
a leak rate of 2.3% is concerning, says 
Robert Howarth, an Earth-systems 
scientist at Cornell University in Ithaca, 
New York. But he cautions that the study 
might have underestimated the actual 
leak rate of methane. Howarth notes that 
the measurements scientists used include 
some obtained with an instrument that 
— according to the device's inventor — 
produces systematically low numbers’. 
What's more, Howarth says, the research- 
ers didn’t look at the emissions from sys- 
tems that distribute gas to urban areas, 
which studies suggest are considerable’. 
But Alvarez looks on the bright side. 
Because a substantial proportion of these 
leaks is probably due to faulty equipment, 
he sees a “tremendous opportunity” to 
reduce methane emissions by developing 
systems to quickly detect malfunctions 
at oil and gas facilities, and by identifying 
overlooked ways in which the greenhouse 
gas escapes into the atmosphere. = 
1. Alvarez, R. et al. Science https://doi. 
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Bacteria deliver 
gene therapies 


Engineered strains of Escherichia coli and other microbes 
are being tested in people as treatments for a slew of illnesses. 


BY SARA REARDON 


eople often take medicines to rid 
Pp themselves of problem bacteria. Now, 

a counter-intuitive approach — turn- 
ing genetically modified bacteria into 
medicines — is gaining ground. 

Several companies are testing whether 
engineered bacteria can treat conditions that 
affect the brain, liver and other organs — 
and even kill other, harmful microbes. But 
although US regulators have approved trials 
of several types of engineered bacterium as a 
form of gene therapy, questions remain about 
whether microbes’ ability to share DNA with 
one another will create long-term safety risks. 

The idea of using bacteria to deliver gene 
therapies first surfaced in the 1990s, but early 
clinical trials met with 
mixed results. Interest 
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teria that live in the 
body — the microbi- 
ome — can influence human health. Research- 
ers are looking to treat disease by modifying 
microbes that are normally found in people or 
the foods they consume. 

Matthew Chang, a synthetic biologist at the 
National University of Singapore, says that 
genetically modified bacteria have the poten- 
tial to treat many diseases. His group is engi- 
neering the gut bacteria Escherichia coli and 
Lactobacillus to recognize and destroy harm- 
ful microbes’. “It’s a rapidly growing area,” says 
Chang, who adds that he is in talks with regula- 
tors in Singapore about starting clinical trials. 

One strain of research is aimed at treating 
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the genetic disorder phenylketonuria. People 
with the condition are deficient in an enzyme 
that breaks down the amino acid phenylala- 
nine, which causes neurological damage if it 
builds up in the body. At the American Society 
for Microbiology’s annual meeting in Atlanta, 
Georgia, earlier this month, researchers from 
the biotechnology firm Synlogic in Cambridge, 
Massachusetts, reported that E. coli modified 
to produce an enzyme that degrades phenyla- 
lanine, and a protein that moves it from blood 
to cells, reduced levels of the amino acid in 
monkeys’ blood by more than half compared 
with animals in a control group. The company 
started clinical trials in healthy human volun- 
teers in April, and will begin testing the bacte- 
ria in people with phenylketonuria as soon as 
it concludes that the therapy is safe. 

Another firm, Intrexon of Germantown, 
Maryland, has altered Lactococcus lactis, 
a bacterium used in cheese production, to 
make a protein that protects skin’s outer lay- 
ers. One clinical trial that has enrolled about 
200 people with cancer is testing whether an 
L. lactis mouthwash can prevent oral sores that 
are a side effect of chemotherapy. In July, the 
company will begin dosing people who have 
diabetes with a different form of L. lactis that 
produces both the precursor to human insu- 
lin and an immune protein that enhances cells’ 
ability to respond to insulin. 

Both Intrexon and Synlogic have engineered 
their bacteria to reduce their likelihood of 
establishing colonies in the body — which 
means that patients would have to take the 
microbes regularly. But other companies are 
pursuing treatments that would create colonies 
of transgenic bacteria in the body. 

The biotechnology firm Osel in Moun- 
tain View, California, plans to seek US > 
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The Escherichia coli bacteria is being developed as a vehicle for gene therapy in people. 


> government approval this year for a Lacto- 
bacillus that has been engineered to prevent 
HIV transmission. Studies have shown that nat- 
urally high levels of Lactobacillus in the vagina 
can help to protect women against HIV’. Osel 
is attempting to enhance the bacterium’ protec- 
tive properties by modifying it to make a human 
protein that prevents HIV infection. 
Challenges remain before these engineered 


bacteria can enter the market. There is a risk 
that the microbes could pass the human genes 
they carry to other bacteria in the body, with 
unknown consequences. Several companies 
have attempted to prevent this exchange by 
altering the chromosomes of a bacterium, 
rather than its plasmids — tiny pieces of DNA 
that bacteria pass back and forth. They have 
also built in biological ‘kill switches’ to prevent 


the microbes from surviving outside the body. 

This strategy can fail, however. A group 
led by immunologist Simon Carding of the 
University of East Anglia in Norwich, UK, 
engineered” Bacteroides ovatus to treat colitis, 
an inflammation of the intestine. The group 
also edited the bacterium’s chromosome to 
make it dependent on a molecule produced 
by naturally occurring gut bacteria. 

But just 72 hours after the scientists fed the 
bacteria to mice, they found that B. ovatus had 
passed its modified gene to other microbes in 
the animals’ guts — and acquired genes that 
allowed it to live without the molecule. 

The experience caused Carding to abandon 
efforts to develop bacteria as therapies. “It’s 
potentially harmful if it’s not properly con- 
trolled,” he says. 

Synlogic, Osel and other companies say they 
have never observed this type of gene transfer, 
but agree that it is possible. “The microbes are 
extremely smart and they know how to sur- 
vive,’ says Chang. It remains to be seen, he adds, 
whether engineering bacteria to colonize the 
body or die out quickly is a better approach — 
but the answer could emerge as the current set 
of clinical trials wraps up in the next few years. m 
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Hubs of East Asian science 


A specialissue explores the booming research landscapes of 
Hong Kong, Malaysia, Singapore, South Korea and Taiwan. 


only on the giant economies and overlook other research powers. 
This special issue shines the spotlight on five strong science 
centres in East Asia: Hong Kong, Malaysia, Singapore, South Korea and 
Taiwan. This group is incredibly diverse in priorities and approaches, 
but all see science and technology as keys to their future. Together, they 
have an expanding role in the global research enterprise. 

An infographic on page 500 explores the investments that members of 
this group have made in research and development, and how they have 
raised their international standing in spending and research output. 
The breadth of their science and technology programmes is illustrated 
by profiles of ten remarkable researchers who 


S cience is booming in many parts of Asia — but it is too easy to focus 


research in South Korea and Malaysia. Physicist and presidential science 
adviser Han Woong Yeom argues on page 511 that South Korea must 
spend more on basic research and infrastructure while also address- 
ing public concerns such as health care and air pollution. Asma Ismail, 
president of the Academy of Sciences Malaysia, issues a call on page 514 
for scientists to further progress in social as well as economic areas, from 
green technology to the Halal economy. And an Editorial on page 485 
urges science officials across the world to address local needs. 

The growth of science in this region is a lure for scientists. A Careers 
Feature on page 625 traces the path of four researchers who left East Asia 
to pursue PhDs and postdoctoral research, but have now returned to 
develop their research. 


are advancing science — ranging from genome 
editing to green-energy development — and 
supporting their communities (see page 502). 

Two Comment articles chart paths for 


With their substantial investments and 
strengths, these five research centres in 
East Asia are working to secure their future as 
major forces in the global science landscape. m 
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HONG KONG, MALAYSIA, 
SINGAPORE, SOUTH 
KOREA AND TAIWAN ARE 
INVESTING HEAVILY IN 
RESEARCH AS AN 
ENGINE FOR GROWTH. 


BY RICHARD VAN NOORDEN 


Which economies invest the most in 
research and development (R&D)? The 
answer might not be what you think. 
South Korea ploughs a whopping 
4.24% of its gross domestic product 
(GDP) into science and technology — 
neck and neck with Israel, and putting 
much of Europe and the United States 
to shame. Taiwan also invests heavily, 
beating science heavyweight Japan in 
2016 in terms of the share of its 
economy devoted to R&D. 

In East Asia, several science power- 
houses are investing strongly in 
science. Although mainland China and 
Japan get much of the attention here 
— they are the area’s biggest econo- 
mies and have giant scientific workforc- 
es — South Korea, Taiwan, Singapore 
and Hong Kong have established 
themselves as strong supporters of 
research, and Malaysia is fast growing 
its scientific output. 


HUBS OF ASIAN SCIENCE 


A Nature collection 


SPENDING 


Research and development (R&D) investment is rising rapidly in South Korea, Taiwan and Malaysia 
— albeit from different bases. In two decades, South Korea has close to doubled the share of its 
economy spent on research. Taiwan’s proportion is not far behind, and it overtook Japan in 2016. 
Singapore’s spending was keeping pace with Taiwan’s, but has dropped off because of a decline in 
business R&D spending. Only Hong Kong’s investment has plateaued in the past decade or so. 
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Taiwan and South Korea’s R&D investment comes mainly from the business sector, and the 
proportions invested in basic research are accordingly lower. In Singapore, Malaysia and Hong Kong, 
businesses provide around half the R&D financing, more like the United States and United Kingdom. 
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SCIENCE WORKFORCE 


After mainland China and Japan, South Korea has the most researchers in East Asia — and 
Malaysia has grown its science workforce to a point where it is now ahead of Singapore and 
Hong Kong. But in terms of researcher density, South Korea, Taiwan and Singapore stand out. 
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TOP INSTITUTIONS 


Hong Kong, Malaysia, Singapore, 

South Korea and Taiwan boast some of 
the world’s leading institutions, according 
to international rankings and publication 
statistics. And many of their institutions 
have made impressive gains on rankings 
of citation impact over the past decade. 


To identify large leading universities, 
Nature charted institutions that published 
more than 4,500 articles in 2015-17, and 
whose papers were cited at least 30% 
more than the world average. 


FEMALE RESEARCHERS 


Female researchers are under-represented in 
many Asian economies, as they are across the 
world. But almost half of Malaysia’s researchers 
are female, and the United Nations says that 
Malaysia is a world leader in encouraging girls 
and women to participate in science. 
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ARTICLE OUTPUT 


South Korea’s research output had soared to 
some 65,000 research articles in the Scopus 
database last year. (By comparison, mainland 
China produced more than 414,000 articles and 
scientists in Japan published 89,000). Taiwan’s 
output is dipping as a proportion of the world’s 
research, but Malaysia’s volume is rising fast. 


In data collected by 
Scopus, shown here, 
Malaysia has 
overtaken Hong Kong 
and Singapore. But 
3 — the Web of Science 
database has them 
all about level at 
about 0.8%. 
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CITATION IMPACT 


Singapore and Hong Kong have stretched their 
lead over the United States and United Kingdom in 
terms of the average scholarly impact of their 
publications — with normalized citations far above 
the world mean. One reason is that these 
economies have very high rates of international 
collaboration, which is linked to increased citations. 
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Science stars of 


From artificial intelligence to 
infectious diseases, top researchers 
in Hong Kong, Malaysia, Singapore, 
South Korea and Taiwan are making 

big impacts on the global stage. 
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CANCER DETECTIVE 


A geneticist works to improve breast-cancer 
screening among women in Singapore. 


BY SANDY ONG 


a lot like diving because it’s about looking for things that are hid- 
ing in plain sight,” says Li, an avid diver and a human geneticist at 
the Genome Institute of Singapore. In her research, Li sifts through 
information collected from women with breast cancer to detect the risk 
factors that make them more susceptible to developing the disease. “All 
the data are there — it’s about forming a research question,’ she says. 
To make predictions about whether an individual has a high risk for 
breast cancer, Li looks for genetic markers as well as lifestyle factors, 
such as obesity, that are linked to an increased risk of cancer. She 
also incorporates a third component into her risk assessment, using 
mammogram images to study breast density. Searching through this 
combination of data is unusual, says medical epidemiologist Per Hall 
at Stockholm’s Karolinska Institute, who was Li’s PhD supervisor. “She’s 


J Li does some of her best thinking underwater. “Research is 
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one of the new generation of epidemiologists,’ says Hall, who adds that 
risk models are getting better but still require a lot more work. 

Li studied the genetic profiles of hundreds of thousands of European 
women to search for disease markers. Specifically, she looked at the 
most common type of genetic variation among people — changes in the 
individual ‘letters’ in DNA, known as single nucleotide polymorphisms 
(SNPs). Unlike mutations in the BRCA1 and BRCA2 genes, which can 
substantially increase the risk of developing breast cancer, variations in 
individual SNPs do not raise a woman's chance of getting the disease by a 
large margin. However, collective changes to numerous SNPs can elevate a 
womans risk significantly (K. Michailidou et al. Nature 551, 92-94; 2017). 

After moving back to Singapore in 2017, Li switched her focus to 
working with local data. Breast cancer is the most common cancer 
affecting Singaporean women, but they are reluctant to discuss the 
disease and to get regular screenings, says Li. “I think genetics is the 
way to go,” she says. “It doesn't solve everything, but at least you can 
identify the high-risk population.” Physicians can then concentrate on 
getting them screened for the disease. 

“Personally, I have always thought that if you really want to imple- 
ment a change, Singapore is the place to do it because the country is so 
small and compact,’ Li says. She is also keen to tackle breast infections 
in older women, a problem more commonly observed in Asia than in 
the West. “The whole purpose of my work,’ she says, “is to improve 
womens lives.” m 
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THE SENTINEL 


An infectious- disease specialist in 
Hong Kong battles emerging pathogens. 


BY DAVID CYRANOSKI 


in China's Guangdong province, and then spread to Hong Kong and 

eventually the world. Severe acute respiratory syndrome, or SARS, as 
the disease came to be called, “came out of nowhere’, says Malik Peiris. It 
eventually killed almost 800 people in more than two dozen countries. 

SARS’ toll could have been much worse had it not been for Peiris, 
one of the top specialists in emerging diseases. Despite concerns that 
the virus could kill them or their families, Peiris and his team quickly 
pinned down the cause and developed measures to control it. 

Peiris, a virologist at the University of Hong Kong’s School of Public 
Health, is a much-sought collaborator who can navigate political issues 
that arise during outbreaks. “He is quiet, and his personality is very con- 
ducive to collaboration between international scientists of many cultures,” 
says Robert Webster, a virologist at St Jude's Children’s Research Hospital 
in Memphis, Tennessee, and a pioneering researcher of avian influenza. 

Before coming to Hong Kong, Peiris bounced back and forth between 
Sri Lanka, where he was born and completed an undergraduate degree, 
and the United Kingdom, where he did postdoctoral research and a stint 


E late 2002, a mysterious and debilitating respiratory illness emerged 


AIR WARDEN 


with the National Health Service. 
In 1995, Peiris took on the challenge of establishing a clinical-virology 


A Taiwanese pollution expert tackles indoor threats. 


BY NICKY PHILLIPS 


Taiwanese politicians with a radical 

proposal. Su, a specialist in indoor air 
pollution, had led a team that had spent 
more than a decade measuring levels of con- 
taminants in people’s homes and workplaces 
and then tracking the health impacts. Then 
she brought the evidence to the government 
and asked it to consider setting air-quality 
standards for indoor environments. In 
2005, Taiwan proposed limits for the levels 
of certain indoor pollutants, and in 2012 
the government introduced laws to regulate 
indoor air quality. 

“T continue to be proud that when the 
standard was issued we had a huge mass of 
scientific evidence that was so solid that no one 
could challenge the decision,’ says Su, who in 
2015 became the first woman president of the 
National Cheng Kung University in Tainan. 


E 1999, Huey-Jen Jenny Su approached 
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Su has a talent for getting people to stop 
posturing and start focusing on the task at 
hand, says John Spengler, an environmental- 
health researcher at the Harvard T. H. 
Chan School of Public Health in Boston, 
Massachusetts, who was Su’s doctoral super- 
visor. “That’s why she’s been so effective.” 

Su has long been interested in how 
pollution affects people. After completing 
her PhD, she spent a year working on a major 
US government research project that sent her 
into many homes, where she saw first-hand 
the poor living conditions that some people 
endure. This field experience, she says, “gave 
me a lot of inspiration after I returned to 
Taiwan about how to effectively link scientific 
findings to policy”. 

Back home, Su and her team revealed 
how Taiwan's climate generated a particular 
cocktail of indoor pollutants. They found that 
many homes in the hot and humid south had 
high concentrations of allergens, such as dust 


mites and airborne fungi, above the levels 
known to cause respiratory problems. 

They also found that Taiwan’s subtropical 
environment had higher concentrations of 
endotoxin — a component of bacterial cell 
walls linked to asthma severity — than did 
more temperate climates. Su and her col- 
leagues later demonstrated that living in a 
home with visible mould and water dam- 
age was associated with asthma in adults 
and children (N.-Y. Hsu et al. Arch. Environ. 
Occup. Health 67, 155-162; 2012). 

Since becoming university president, Su 
has focused her research on how data on air 
pollution and its health impacts can help 
efforts to make society more resilient to 
global warming. “What choices do we have 
to help people and our built environment 
be more adaptive to the changing climate?” 
she says. 

Su is proud that the government standards 
she helped to develop are protecting people's 
health, especially those with limited finan- 
cial resources. “Someone who is committed 
to public health should always keep in mind 
those people who are disadvantaged,” she 
says. “That has been my core value and 
principle all these years.” = 
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laboratory in Hong Kong. He was put to the test two years later, when 
Hong Kong recorded the first known case of the H5 strain of bird 
flu jumping to people. Eventually, 18 became infected and 6 died. 
The outbreak terrified infectious-disease experts because H5N1 was 
known for wreaking havoc on bird populations. 

Peiris made his name with H5N1. He discovered how the virus 
kills people by triggering an overreaction in the immune sys- 
tem — known as a cytokine storm — that attacks the body’s organs 
(C. Y. Cheung et al. Lancet 360, 1831-1837; 2002) . He also showed 
that interventions in live-poultry markets — such as closing the 
market for one or two days every month, or removing the poultry 
overnight — dramatically reduced the spread of the virus in the 
market, thereby reducing risk to people. 

The H5N1 scare helped Peiris and others to prepare Hong Kong 
for future outbreaks. “The system set up in response to bird flu is 
what helped us to deal with SARS,” he says. 

Six years later, Peiris’ scientific reputation burgeoned through the 
SARS crisis. He led a team that discovered that SARS was caused bya 
coronavirus, and developed strategies to contain the virus. 

Lately, Peiris has shifted his attention to Middle East respiratory 
syndrome (MERS), another coronavirus disease with pandemic 
potential, which he says isn't getting enough attention. Evidence 
suggests that the virus has been crossing from camels to humans 
— mainly in Saudi Arabia — in a way that reminds Peiris of local 
SARS outbreaks before the disease spread globally. “It has the greatest 
capacity for huge global impact,’ he says. 

Although his scientific reach is now global, Peiris says his location 
in Hong Kong remains crucial to catching emerging threats. The city 
has long been a crossroads of diseases that go global. “For a career 
in the type of things I’m doing, he says, “it’s a perfect place to be.” m 
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POWER PLAYER 


A chemical engineer turns waste into fuel. 


BY YAO-HUA LAW 


she arrived at the University of Leeds, UK, in 1992. She had 

received a scholarship from the Malaysian government to do 
an undergraduate degree in chemical engineering, but, she says, 
“T didn't know what chemical engineering was’. 

She quickly grew fond of the field for its mix of science and practi- 
cal applications — so fond that she received a PhD in chemical engi- 
neering from the University of Bradford, UK, in 1998. The eldest of 
seven children, Yusup is the only academic in her family. “My parents 
wanted me to be a doctor, any doctor, but they knew I was afraid of 
blood. So for them, this [degree] is something to be very proud of” 

Yusup now heads the Center for Biofuel and Biochemical Research 
at Universiti Teknologi Petronas (UTP) in Perak, Malaysia. She 
joined the university in 2001 and began exploring Malaysia's rich 
plant resources to produce biofuels. Much of her work has focused on 
creating fuels from biomass waste, such as used cooking oil, rubber- 
seed oil and discarded distillate from palm-oil refineries. 

She has long explored ways in which green technology can help the 
environment and society. When she learnt that rice farmers around 
her university were struggling with pests, weeds and the health threat 
of chemical pesticides, Yusup started developing safer biopesticides 
based on compounds from plants. She adapted the techniques and 
facilities in her biofuel lab to produce the pesticides. Yusup has 
won many honours, including Malaysia's Rising Star Award in the 
category of Highly Cited Review in 2016 and the Women in Science 
Award in 2017. 

Yusup’s latest research interests reflect her hobby of gardening. 
In particular, she worries that she might be too weak to plough the 
soil after she retires. “I am an engineer and I should do something 
about it,” says Yusup. She developed a hydroponic system that is soil- 
free and requires no ploughing, then got her engineering students 
to teach it to schoolchildren. Her students complained, but quickly 
turned enthusiastic after their first harvest. When researchers see the 
impact they have on others, it helps motivate them to find solutions 
to problems, says Yusup. “That’s what keeps me going.” m 


uzana Yusup didn’t have a choice in her course of study when 
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GRAPHENE GROWER 


A physical chemist takes a trick from frogs to develop 
better ways to grow graphene. 


BY SANDY ONG 


s aboy growing up in Singapore, Loh 
A Kian Ping fell in love with science 

after seeing specimens of dissected 
frogs. Decades later at the National Univer- 
sity of Singapore, the physical chemist once 
again drew inspiration from frogs — this 
time, to make one of his key discoveries. By 
mimicking how the amphibians latch on to 
leaves, Loh developed a method to connect 
graphene to silicon wafers, which could help 
in industrial applications such as improving 
optical communications. 

With its single layer of carbon atoms, 
graphene is hailed by the microelectronics 
industry for its thinness and its ability to 
conduct electricity. But growing graphene 
ona surface is tricky. If the conditions are 
wrong, the thin layer of carbon peels off and 
floats away in solution. Or too many carbon 
layers grow and create graphite instead. 

Loh and his colleagues devised a way to 


grow graphene on copper-coated silicon 
wafers, then etch away the copper (L. Gao 
et al. Nature 505, 190-194; 2014). To pre- 
vent graphene from washing away as well, 
Loh made special pretreated silicon wafers 
that attach to the newly grown graphene 
through capillary bridges — the same 
microstructures that allow frogs to adhere to 
lotus leaves. “It’s a biomimetic way of attach- 
ing graphene to silicon wafers,’ says Loh. 
Successfully growing graphene on 
silicon microchips is a crucial step towards 
using the material on a commercial scale 
in industry to improve the performance 
and cost of electronic devices, says chemist 
Xiangfeng Duan at the University of Cali- 
fornia, Los Angeles. Loh’s achievement, he 
says, helped to address a key bottleneck. 
Loh has been studying graphene and 
other 2D materials for more than a decade, 
and has earned a reputation as a master 
grower of these super-thin nanomaterials. 
It’s a highly competitive field, but Loh 
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“displays a rare combination of many 
unique skills”, says Duan. He adds that Loh’s 
training as a physical chemist allows him to 
bring perspectives from both fields. 

He also brings insights from around the 
world. Loh did his PhD at the University of 
Oxford, UK, and his postdoctoral research 
at the National Institute for Materials Sci- 
ence in Tsukuba, Japan. He chose to return 
home in 1999 because, he says, “the funding 
situation in Singapore is very healthy” and 
he felt the pull of his country. “Ultimately, 
its home.’ His work has paid off there, earn- 
ing him the 2014 President's Science Award, 
the highest recognition given to researchers 
in Singapore. 

Next, Loh wants to focus on translating 
his research into useful products that will 
help contribute to his homeland. “If you 
just focus on publishing papers, there’s no 
value capture for Singapore,’ says Loh. 

Already, his work on graphene and other 
super-thin materials has found several 
environmental applications, ranging from 
green alternatives to metal catalysts (C. Su 
et al. Nature Commun. 3, 1298; 2012) to 
organic solar cells. “When you study 
2D materials, they always surprise you 
because new properties appear,’ says Loh. 
“Tt's a very rich playground for intellectual 
discovery.’ m 
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INFECTION FIGHTER 


A parasite researcher in Malaysia works to 
improve the lives of indigenous people. 


BY YAO-HUA LAW 


in human faeces. As part of her PhD project in the 1990s, Lim 

collected samples from the indigenous people of Peninsular 
Malaysia in the west of the country and checked for signs of parasitic 
protozoa. As she went from hut to hut, her study subjects called out, 
“The stool person is coming!” She made fun of her title — and that 
endeared her to those she was studying. 

Two decades later, Lim is nowa parasitologist and deputy dean in the 
Faculty of Medicine at the University of Malaya in Kuala Lumpur, where 
she continues to study parasitic helminth worms among the indigenous 
tribes, known collectively as the Orang Asli. She has found that a heavy 
rate of helminth infections persists among only certain populations, 
generally in places hit by poverty and poor sanitation. Working out why 
is a complex problem, and remains a focus of Lim’s work. 


é vonne Ai Lian Lim has developed a reputation for her interest 


She reaches into her drawer and pulls out a pouch woven from leaves. 
“The handiwork is excellent, it’s so fine!” says Lim of the pouch, which 
was produced by one Orang Asli tribe. Members of that group have 
boosted their income by selling their craftwork, says Lim, who has con- 
nected with lawyers, economists and artists to help the Orang Asli to 
market their products. 

Her work is also having a scientific impact. In 2016, she received atop 
research scientist award from the Academy of Sciences Malaysia, and 
co-authored a paper in Science (D. Ramanan eft al. Science 352, 608-612; 
2016) that is in the top 1% of the most-cited papers in immunology for 
2016. The study revealed a flip side to helminth worms: that a low-level 
infection protects people against inflammatory bowel disease by regulat- 
ing the immune system and gut bacteria. The work has won Lim and her 
collaborators a four-year grant to study helminths as therapeutic agents. 
“We always think of the bad effects of the worms,” says Lim. But when 
the worms are present in low levels, she says, they “can be angels too”. 

Helminth infection levels among the Orang Asli, however, are much 
higher and inflict a heavy toll. Lim says she will continue gathering 
information and pushing the issue with authorities until the problem is 
solved. The work has a personal dimension for Lim, who looks back on 
her doctoral work among the Orang Asli as a life-changing experience 
that keeps drawing her back to the region. She learnt from them, she 
says, “that life need not be so complicated, that life can be simple, that 
the richest people in the world are those that need the least: m 
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METAL MAESTRO 


A chemist looks for new ways to make molecules shine. 


BY DAVID CYRANOSKI 


hen Vivian Yam turns on a light, 
she sees a problem — and an 
opportunity. Much of the elec- 


tricity produced globally goes into lighting, 
and a large portion of that energy is wasted 
by inefficient bulbs. But many thriftier lights 
contain environmentally hazardous materials 
such as mercury. 

Yam designs organic light-emitting diodes 
(OLEDs) to produce cheap and environmen- 
tally friendly options. “If we can improve the 
performance of these devices, we can save a lot 
of energy,” she says. 

Light has been a running theme in Yam’s 
career. The University of Hong Kong chemist 
has spent more than two decades creating 
metal-containing compounds with unique 
abilities to absorb and emit light. Such 


technologies could be used, for example, 
to harness solar energy, sense early signs of 
Alzheimer’s disease in people’s brains and 
create various types of OLED display. Her 
original approaches have earned her vari- 
ous accolades, including being the youngest 
person elected as a member to the Chinese 
Academy of Sciences. 

It was a broken thermometer that helped 
to spark her early interest in chemistry, says 
Yam, who played with the spilt mercury and 
marvelled at how it flowed and coalesced into 
balls. She is still experimenting with metallic 
elements, but now at a molecular scale. 

By manipulating the forces that hold 
molecules together, Yam can control their 
alignment. This, in turn, allows her to create 
larger, complex structures called supra- 
molecular assemblies that have a unique ability 
to emit or absorb light. “Just the way they are 


GENOME EDITOR 


A South Korean researcher who helped to 
develop CRISPR wants to improve crops. 


BY MARK ZASTROW 


Kim an idea for his next set of experiments. As a pioneer of gene- 

editing techniques, Kim has felt constrained by tight restrictions 
in South Korea on using biotechnology techniques to improve crop 
yields and improve resistance to disease. So he is thinking of doing 
experiments where there is less red tape, and where the government 
has an incentive to invest in resilient crops: North Korea. 

The country has frequently struggled with famine tied to poor 
harvests. “Maybe this technology can be adopted very quickly to solve 
the current problem in North Korea, and that’s exciting,” he says. 

Kim is no stranger to forging new paths. In the late 1990s, he had a 
premier position leading a team at the Samsung Biomedical Research 
Institute in Seoul. He was working in the nascent field of gene editing 
but felt trapped by the bureaucracy of a giant corporation. “I thought, 
‘Why work for Samsung if I can do this myself?” 

So, in 1999, he formed a company called ToolGen to develop his own 
genome-editing technology. It was a bold decision in a country that 
at the time had almost no venture-capital firms investing in biotech 
start-ups. The company struggled for financing in its early years, and 
at one point shrank from 28 employees to just 5. 

Today, ToolGen is stable, with a market capitalization approaching 
US$1 billion. Kim, who left the company in 2005 for a post at Seoul 


Ts détente between North and South Korea has given Jin-Soo 
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National University, remains a shareholder and licenses his group's 
gene-editing technologies to it. 

Among them are CRISPR-Cas9, the easy-to-use ‘genetic scissors’ 
that enable researchers to precisely cut and paste genetic material in 
living cells. In January 2013, Kim was one of the first to stake a claim to 
it, publishing a paper (S. W. Cho et al. Nature Biotechnol. 31, 230-232; 
2013) that showed how the tool could be used to snip at targeted spots 
ofa DNA sequence. He holds CRISPR patents in several countries. 

Kim continues to work at the frontier of gene editing, and is now 
jointly affiliated with South Korea's Institute of Basic Science. His lab’s 
results include using the technology to edit the genomes of plants and 
to create pigs with extra muscle. And Kim is working to find therapeutic 
uses in humans. In 2017, he teamed up with Shoukhrat Mitalipov at the 
Oregon Health & Science University in Portland in an attempt to repair 
a mutation that can cause heart failure in humans, and then to insert 
the corrected gene into viable human embryos (H. Maet al. Nature 548, 
413-419; 2017). 

Therapeutic uses could be many years away, but Kim is optimistic that 
gene-edited crops could arrive sooner, with improved yields and resist- 
ance to disease. South Korea's regulation of such crops is unclear, how- 
ever, and they might fall under strict guidelines for genetically modified 
organisms, even when no foreign genetic matter has been introduced. 

Kim is considering collaborating with researchers in North Korea, 
which might be more receptive to editing genes in crops. By law, South 
Koreans are forbidden from seeking contact with North Korea. But Kim 
hopes that the current thaw in diplomatic relations will foster scien- 
tific cooperation. For example, students from both nations could work 
together in unified labs, with North Koreans returning home to train 
others. “I think they can do very well — they can catch up very rapidly,’ 
Kim says. 

Wherever he works, Kim is intent on driving the process. “I want to see 
the products,’ he says. “I want to see at least one thing in my lifetime? m 
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packed makes such a dramatic difference in 
their optical properties,” she says. 

Cost is a key consideration in her science. 
To develop phosphorescent OLEDs, known 
for their high efficiency in converting light 
into electricity, she bypassed the materials 
most commonly used by others: iridium and 
platinum. Instead, she went with gold, which 
is more abundant, environmentally benign, 
and cheaper. She eventually made the world’s 
first gold-based phosphorescent OLEDs 
(V. K-M. Au et al. J. Am. Chem. Soc. 132, 
14273-14278; 2010). 

OLEDs light up the screens of many 
electronic devices, from mobile phones to 
televisions, so Yam’s work caught the attention 
of TCL, a firm based in neighbouring Guang- 
dong province and one the world’s largest 
producers of TV sets. 

The company established a joint laboratory 
with Yam at the University of Hong Kong 
to develop gold-based versions of printable 
OLED materials, currently a hot field in the 
é TV industry. If Yam’s work pans out, her inven- 
tion could light up the lives of people across 
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PROGRAMMING PIONEER 


A computer scientist seeks to create a spoken 
version of Google. 


BY NICKY PHILLIPS 


to learn a language or study ancient history. Instead, he’s training algorithms to 
extract key words and phrases from audio and video recordings. 

He and many other computer scientists say that this is one of the crucial steps 
towards unlocking a vast amount of knowledge that is currently difficult to search and 
organize. “My whole career has been motivated by identifying interesting problems 
which can have great impact if solved, but are very difficult, he says. 

Lee, a computer engineer at the National Taiwan University in Taipei, trained as 
an electrical engineer, completing his PhD at Stanford University in California in the 
1970s. In the United States, he also worked on satellites used for telephone communi- 
cation. “When I decided to return home, all my friends said, ‘you are stupid because 
there’s no work on satellites in Taiwan,” he says. 

But Lee felt that his familiarity with voice signals presented an opportunity: he 
traded satellites for computers, and started developing a tool that could recognize 
spoken Mandarin. Over more than a decade, his team built a device that was crude and 
slow — it took five to six seconds to recognize a single syllable — but a huge achieve- 
ment. It was the first speech-recognition device for Mandarin. By 1995, their machine 
could transform continuous Mandarin speech into a series of Chinese characters. 

“Speech recognition is a very hard problem,” says Jim Glass, a researcher in speech 
processing at the Massachusetts Institute of Technology in Cambridge. Everyone in 
the world speaks slightly differently, in a continuous stream, often against a noisy 
background, he says. “Somehow we have to extract information from that,’ says Glass. 
“Lin-Shan is a pioneer in Mandarin spoken-language processing.” 

Lee is now building on that success to tackle another speech-recognition challenge: 
retrieving spoken content from audio and video files. His team is using machine 
learning to build a system in which users can search for specific words, phrases or 
sentences within a video. “I call this a spoken version of Google,” he says. It would 
revolutionize multimedia sites such as YouTube. Currently, there is no way to locate 
a specific sentence within the video without going through the entire clip. Although 
there are some existing technologies that can process speech in multimedia, they are 
not very accurate, says Lee. “It is my research goal at the moment to develop more- 
accurate technology for this.” = 


| in-shan Lee spends a lot of time exploring online courses, but he isn’t looking 
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RNA EXPLORER 


A biochemist investigates small RNA 
molecules that regulate gene expression. 


BY MARK ZASTROW 


hen Narry Kim took her first faculty position in 2001, she 
Wi: no funding, no students and no experience in her newly 

chosen field — investigating a class of recently discovered 
molecules called microRNAs (miRNAs). 

It was a leap into the unknown for her, establishing her academic 
independence by leaving behind the fungi and viruses she had studied 
in graduate school. “It was kind of scary at the beginning. But looking 
back, it was a great decision,” Kim says. 

Now, as she sits in her well-funded lab at Seoul National University, 
she waxes nostalgic for those times. Her lack of money forced her to 
be creative, designing cheap experiments that directly attacked the key 
unanswered questions of how miRNAs form and what they do. 

The first examples of miRNA were discovered in 1993. Much shorter 
than the messenger RNAs that translate genetic code from DNA to pro- 
teins, miRNAs have a role in blocking gene expression. But, as Kim 
began her work, the origins of the tiny stretches remained mysterious. 
In 2002, in her first paper as a group leader (Y. Lee et al. EMBO J. 21, 
4663-4670; 2002), she and her small team showed that they are gener- 
ated in two steps — first in the nucleus, then with further processing in 
the cytoplasm into mature miRNA. She followed that up with another 
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paper the next year detailing a major pathway through which miRNA 
is formed (Y. Lee et al. Nature 425, 415-419; 2003). 

“She and her team have made tremendous contributions,” says Ling- 
Ling Chen, an RNA researcher at China’s Shanghai Institute of Biochem- 
istry and Cell Biology. “As a scientist, Narry is precise and dedicated.” 

Today, Kim is one of South Korea’s most prominent researchers, and 
has a joint affiliation with the Institute for Basic Science, the country’s 
premier network of blue-sky research labs. In 2009, at the age of 39, she 
became one of the youngest winners of the Ho-Am Prize for Medicine, 
often considered as South Korea's equivalent to the Nobels. 

Along the way, she has also become a role model for young scien- 
tists — especially to women, who make up just 19% of South Korea’s 
scientific workforce (see infographic, page 500). 

Kim recalls a moment that could have ended her career before it had a 
chance to take off. While on a year-and-a-half-long break from research, 
she started a family and moved to small towns for her husband’s jobs. 
“There wasnt any library to read journals and no lab to work,” she says. 
And she felt the deck was stacked against her in the country’s academic 
culture. “That was in the late 90s, and for women, even if they excelled 
in science, it was very difficult to find a permanent position,” she says. 
“I was losing hope.’ She decided to quit, and began studying for law 
school. “But it was so boring,’ she says. “And I wanted to go back to the 
lab so desperately,” 

Although RNA research has plenty of possible therapeutic applica- 
tions, Kim says she has too little time to explore them, and firmly intends 
to take her mandatory retirement from academia at 65. “I have other 
things to do’, she says. One goal is to write a novel with multiple time- 
lines that show how science and technology change the course of human 
history. She sees that as a way of extending her impact beyond academia. 
“Storytelling is always more powerful.” = 


17.77 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


SHIN WOONG-JAE FOR NATURE 


CHUNG SUNG-JUN/GETTY 


COMMENT 


Priorities for science, Israel’s new The influence of Al could 
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A piloted walking robot developed by robotics company Korea Future Technology in Gunpo. 


Restructure science 
in South Korea 


To build on the success of its centralized research agenda, the nation must 
switch to projects led by independent principal investigators, urges Han Woong Yeom. 


and floundering. In some ways, things 

could not be better. National spending 
on research and development (R&D) by 
industry and government was 4.24% of gross 
domestic product (GDP) in 2016 — the 
second-highest percentage for any country 
worldwide (Israel was first, with 4.25%). For 


. outh Korean science is both flourishing 
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three decades, the government has invested 
billions of dollars each year in high-tech 
industry, turning South Korean electronics 
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companies such as Samsung, LG and 
SK Hynix into world leaders. 

Academic research is booming, in 
terms of the numbers of papers published 
and citations: in 2016, the nation ranked 
12th and 13th, respectively (see go.nature. 
com/2jwxzcc). Institutes and facilities are 
being built. In 2011, the government 
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> set up the Institute for Basic Science, 
a national network of research centres 
mimicking Germany’s Max Planck 
Society. In 2017, it launched a world- 
class X-ray free-electron laser facility 
in Pohang’, and in 2021, it will opena 
heavy-ion accelerator in Daejeon. 

Yet many researchers are dissatis- 
fied. In 2017, biologist Won-Kyung Ho 
at Seoul National University initiated a 
nationwide movement calling for more 
funding for projects proposed by indi- 
viduals. In 2017, just 6% of the national 
research budget went to projects led by 
small research groups. The rest went to 
big, government-directed ‘top-down 
projects in strategic areas such as infor- 
mation technology, robotics, materials 
and biotechnology. This imbalance was 
arguably the biggest scientific issue dis- 
cussed during the presidential election 
campaign last year. 

Something is wrong with the South 
Korean R&D system, and everyone 
knows it. The country invests a lot but 
gets less and less back. Scientists feel 
disenfranchised by the government's 
opaque system for funding. The public 
is not seeing solutions to pressing 
problems, such as air pollution. Even 
government ministers and economists 
complain that all those highly cited 
papers are not generating enough new 
technologies. 

But that is not so surprising. An 
approach that gets a nation from science 
infancy to adolescence might not be the 
best to take it to maturity. Now that South 
Korea’s high-tech industry is world-leading, 
it no longer needs to be directed by the state. 

The country’s R&D is at a turning point. 
It needs a new strategy. South Korean 
industry must change from fast follower to 
first mover. Government R&D must pave 
the way. But I think that backing certain 
industries and technologies is no longer 
the best way to do it. A better bet is long- 
term support of basic science, investment 
in infrastructure and human capital, and 
addressing public issues. 

As vice-chair of South Korea’s Presidential 
Advisory Council on Science and Technol- 
ogy, here I outline the three main challenges 
and some steps taken in response. I call for 
urgent discussions among scientists, indus- 
trialists and policymakers to forge a more 
effective path. 


THREE PROBLEMS 

Bottom-up basic research is underfunded. 
In 2017, only US$1.2 billion of South Korea’s 
$18 billion of government research fund- 
ing went to bottom-up (independent) basic 
research projects by individual investiga- 
tors and small groups (see ‘Budget break- 
down’). Eighty per cent of these projects 
received less than $50,000 per year, which 
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BUDGET 
BREAKDOWN 


In 2017, most project-based science 


by the government, and much of that 
went to applied science. 


Military, universities, small companies and so on 9 bn 


is insufficient to fund a globally competitive 
study. Just 11% of grants proposed by mid- 
career researchers were accepted last year. 
This is much less than the roughly 30% 
acceptance rate that is standard in the 
United States, the European Union and 
Japan. 

This is mainly the result of previous 
governments’ policies, which focused on 
strategic areas and the development of 
emerging technologies, industries and mar- 
kets to fuel economic growth. For example, 
between 1999 and 2013, $1.4 billion went 
to 16 projects in the 21st Century Frontier 
R&D Program. These ranged from nano- 
materials and nanodevices to proteomics 
and hydrogen fuel. Over the 2010-23 
period, ten projects in the Global Frontier 
Program run by the Ministry of Science and 
ICT will consume similar budgets. Topics 
range from flexible electronics to biomedi- 
cal technology. 

This approach has produced a two- 
tier R&D system. Little has been spent on 
research infrastructure in universities, and 
young researchers find it hard to obtain 
start-up grants. By contrast, some leading 
Chinese universities offer start-up grants of 
$20 million to promising researchers in areas 


labs 2.7 bn 


6% went to proposals 


spending in South Korea was directed 


such as condensed-matter physics — ten 
times more than a South Korean scientist 
might expect. Unsurprisingly, many tal- 
ented young physicists from South Korea 
now work in China. 

R&D is inefficient. South Korea pro- 
duces many patents, but few innovations. 
For example, in 2016, South Korea ranked 
third in the world for the number of pat- 
ents it filed, after the United States and 
Japan’. Yet the Netherlands, which ranked 
sixth and has one-quarter of South Korea's 
R&D budget, earned five times more in 
technology payments. South Korea scores 
low in terms of its ‘innovation potential’ 
(its ability to create and use innovations), 
which includes factors such as the quality 
of its research institutes, workforce and 
collaborations between universities and 
industry. The World Economic Forum's 
Global Competitiveness Report 2017-18 
lists South Korea as 26th in the world, 
behind New Zealand (13th), Taiwan 
(15th), Malaysia (23rd; see page 514) and 
Ireland (24th)’. 

Part of the problem is poor govern- 
ment leadership of R&D projects. Gov- 
ernment-driven projects are often badly 
designed and can have unclear goals that 
fall below the standards used in industry. 
For example, the country’s next-gener- 
ation superconducting technology pro- 
ject began in 2001 without a clear idea of 
market needs, and ended up without any 
serious research output. None of the test 
products developed by the 21st Century 
Frontier R&D projects had a substantial 
market impact by 2013, when the initia- 

tive ended. 

Unsurprisingly, industrialists, policy- 
makers and the public are dissatisfied 
with the outcomes. Academics feel that 
mega-projects produce fewer papers and 
citations than would all the small, independ- 
ent projects that could have been funded. 
International collaborations and the infra- 
structure needs of the science community 
are being neglected. 

Public concerns are not being met. South 
Korea’s citizens feel that they are paying for 
science that is not addressing their problems. 
Severe air pollution is the main example. 
South Korea's air has the highest concentra- 
tion of fine particulates (PM,, and PM, ;) out 
of the 34 countries in the Organisation for 
Economic Co-operation and Development: 
triple that of the United States and twice that 
of Japan. Neither the government nor the 
scientific community collects proper data 
on air quality or health impacts. 

In South Korea, the proportion of people 
aged over 65 is increasing at the high- 
est rate in the world. Conditions such as 
Alzheimer’s disease have become a major 
social issue. Yet there is no clear national 
R&D strategy for this. 

Several crises have further undermined 
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An engineering student at Sung Kyun Kwan University near Seoul inspects a wet-tolerant adhesive 


a 
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patch inspired by the suckers found on octopus tentacles. 


the public’s trust in the science system. 
One was the magnitude-5.4 earthquake 
in my home city of Pohang in November 
2017, which was possibly caused by a geo- 
thermal power plant injecting water into a 
fault zone*. Another was the deaths of more 
than 100 people — including children and 
their mothers — which were linked in 2011 
to inhalation of toxic disinfectants that 
had been sold for use in home humidifiers 
since 2001. In both cases, scientists and the 
government were criticized for dodging 
their responsibilities. 


POLICY DRIVES 

The current government acknowledges all 
of these problems. Within two months of 
winning the election in May 2017, it pro- 
posed policy changes in three areas over 
the next five years. These are welcome, but 
leave many issues unaddressed. 

First, the government will double the 
budget for bottom-up research propos- 
als, reaching $2.2 billion in 2021. This is 
a good start. But universities and national 
labs need improvements to infrastruc- 
tures, core facilities, technical capacities 
and research administrations. Students 
and postdoctoral researchers need more 
financial support. Addressing all of these 


areas could require as much funding again. 
Second, more government initiatives 
will focus on societal issues, such as air 
pollution, earthquakes, chemical hazards, 
infection, climate change, Alzheimer’s 
disease and renewable energy. However, 
this doesn't solve the inefficiency problem 
with government- 


“Something is led programmes. I 
wrong with the am already seeing 
South Korean poorly designed 
R&D system, and badly directed 
and everyone projects being set 


up, such as those 
on environmental 
issues and emerging technology. Late last 
year, South Korea's president, Moon Jae-in, 
called for improvements to the planning 
and organization of R&D projects, but these 
have yet to take effect. 

Third, industrial support needs reform. 
It is unclear which directions will most 
benefit the economy and where top-down 
involvement will be most effective. The cur- 
rent government is betting on the ‘fourth 
industrial revolution to generate com- 
mercial opportunities and quality jobs in 
artificial intelligence, data science, ultra- 
fast mobile communication, the Internet 
of Things, smart cities, bioinformatics and 


knows it.” 


so on. But artificial intelligence could take 
away as many jobsas it creates, for example. 
In my view, the concept of the fourth indus- 
trial revolution is too vague to pin taxpayers 
money on. 

Furthermore, these three policy drives 
will squeeze budgets in other research 
areas. There is no clear idea which areas 
those should be. 


OPEN DIALOGUE 

I don't have all the answers. But the first 
thing that needs to happen is a frank and 
honest debate. Whatever policy we want 
to make or whichever problem we want to 
solve, openness and rigour will reduce the 
possibility of major errors. 

I would like to see more scientists get- 
ting involved in policymaking in South 
Korea. I will seek to establish a culture that 
makes it routine for the government and the 
scientific community to hold regular, trans- 
parent, evidence-based discussions. As a 
first step, the Presidential Advisory Coun- 
cil on Science and Technology will look at 
South Korean R&D directions and at ways 
to strengthen its research institutes. 

The roles of government and indus- 
try need to be reset in light of the rapidly 
changing global economy. Sweden, for 
example, has come under fire in the past 
for its high R&D spending and low output’. 
The rate of innovation is slowing world- 
wide’, and each advance is becoming more 
expensive. For example, with silicon tech- 
nology approaching its fundamental limit, 
the cost of creating computer chips that are 
ever smaller has exploded. 

South Korean science has come a long 
way in the past 30 years. The challenges it 
faces present a great opportunity: it is time 
for South Korea to reposition itself to deliver 
the best science, innovations and solutions 
for its people. m 
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Improving health and well-being in Malaysia is a priority for the government. 


A road map for 
Malaysian science 


Plug the digital divide, improve health care, develop 
biotechnology industries and innovate for the 
trillion-dollar Halal economy, urges Asma Ismail. 


has the third-highest gross domes- 

tic product (GDP) per person of the 
ten countries in the Association of Southeast 
Asian Nations (ASEAN). It has achieved this 
by shifting from producing cheap goods such 
as tin, rubber, cocoa, timber and rice to more 
lucrative commodities including oil, natural 
gas and palm oil. Science and technology are 
central to the country’s economic growth 
strategy. For example, electronic components, 
petroleum products and petrochemicals, 
machinery and food are major exports. The 
government views biotechnology as a means 
of enhancing prosperity and wellness. 

But new paths must be forged. The econ- 
omy has slowed since the Asian financial 
crisis in 1997, as the advantages that have 
historically made Malaysia attractive for 
investment — pools of low-skilled labour, 


S= years after independence, Malaysia 
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good infrastructure and tax incentives — have 
diminished. The nation is vulnerable to com- 
petitors such as China that produce luxury 
goods, and countries with lower labour costs, 
such as Vietnam. However, as a majority 
Muslim country, Malaysia is well positioned 
to contribute scientific innovations to pre- 
mium products and services for the Halal 
economy. 

The nation must tackle three further 
emerging problems — health problems, 
rapid urbanization and climate change. 
Malaysia has the highest level of obesity in 
southeast Asia, with nearly half of adults clas- 
sified as overweight or obese. The average 
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age of the population is rising faster than 
for any other country in the world. By 2050, 
20% of Malaysians will be aged 60 or older, 
compared with 8% today’. By 2050, 9 out of 
10 Malaysians are expected to live in cities, 
putting pressure on resources. Malaysia is 
already on average 1 °C warmer than it was in 
1999; an increase of another 2°C is predicted 
by 2050 (go.nature.com/2hdrésj), along with 
a higher sea level and increased rainfall. 

Malaysia must build on its strengths to 
overcome these challenges. The country is 
one of the most biodiverse on Earth, with 
almost 19,000 species” of animal, plant, 
fungus and moss as well as ancient tropical 
rainforests. It has more than 400 vibrant insti- 
tutions of higher learning and untapped pools 
of talent, producers and consumers. 

Malaysia spends a small proportion of its 
GDP on research and development (R&D), 
allocating the sector 1.3% in 2015, compared 
with 2.5% on average for developed nations’. 
A lack of national coordination is hampering 
R&D. Few researchers work in industry, and 
too little is being spent on translating R&D 
outputs into products and services in compar- 
ison to developed economies, such as South 
Koreaand the United Kingdom’. Research is 
diffused across many institutions. 

Malaysia needs a new national-level 
strategic direction for science, technology 
and innovation to better focus its efforts. 
As president of the Academy of Sciences 
Malaysia (ASM), an independent body man- 
dated to be the country’s thought leader for 
science, technology and innovation, I out- 
line how the science and technology system 
can be re-engineered to help the economy 
and people. 


MANY BODIES 
The Malaysian science, technology and 
innovation landscape is extremely diverse. 
Science cuts across 23 federal ministries, and 
the country has 14 state governments and ter- 
ritories’. Overall, 268 entities are involved in 
implementing policies and initiatives. Cen- 
tral coordination and planning is needed to 
minimize duplicated efforts, and to overcome 
the fragmented resources and ineffective 
decision-making that dilute impacts. Of the 
total R&D budget, 70% (10.6 billion Malay- 
sian ringgits; US$2.6 billion) was spent on 
applied research and 21% went to basic 
research — proportions in line with devel- 
oped countries such as the United Kingdom 
and Singapore. But Malaysia falls short on 
investment in experimental development, 
spending just 9% on activities such as proto- 
typing, piloting and scaling-up. By contrast, 
South Korea, Japan and Ireland spend 62%, 
64% and 48% of their R&D budgets, respec- 
tively, on translating research into products 
and services’, 

Institutions of higher learning host 78% 
of Malaysian researchers; only 12% are in 
business enterprises and 10% are in the 
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government sector’. In South Korea, 80% of 
the country’s researchers were in businesses in 
2015. In Singapore and Thailand, it was 51%. 
This shows that Malaysia’s R&D is not 
industry-driven and the lack of high-skilled 
talent in businesses is limiting translation of 
research outputs. One reason science has not 
percolated well into Malaysian industry is that 
98.5% of businesses are small or medium- 
sized enterprises (SMEs). These SMEs lack 
skilled workers and money to invest in R&D. 
Three-quarters of the workforce is low- 
skilled, which can make integrating science a 
challenge. Large firms were more than three 
times more productive than SMEs in 2016, in 
terms of added value per worker, contributing 
63% to the nation’s GDP. SMEs should step 
up their productivity and contribute more to 
economic growth and social well-being. 

Networking and collaboration could 
enhance SME productivity in the short term. 
But in the long term, Malaysian businesses 
need to build up and engage more scientists. 
This will propel them from imitating and 
adapting to creating. 

Another problem is that companies rarely 
engage with communities who will use the 
technologies, especially the poorest. Few 
manufacturers want to produce products for 
markets that they perceive as unprofitable. 


PRIORITY AREAS 

In 2017, an ASM study on emerging 
Malaysian science and technology high- 
lighted five technology areas for investment: 
biotechnology, digital technology, green 
technology, nanotechnology and neurotech- 
nology (see “Technology game-changers’)'. 
These cut across disciplines and would poten- 
tially boost both industry and society. 

The Halal economy is another area in 
which Malaysia is a leader and could con- 
tribute more scientifically, by developing 
products, standards and tracking systems. 

Consumers spent US$2 trillion on Halal 
products and services globally in 2016, and 
1.7 billion consumers adhere to Halal prin- 
ciples such as avoiding animal products 
and alcohol in pharmaceutical ingredients’. 
These principles are of increasing interest 
beyond Islamic societies. The Halal market 
extends from food to finance and banking, 
insurance, education, health care and tour- 
ism. In 2016, the Halal industry accounted 
for 7.5% of Malaysia's GDP, with 1,400 local 
enterprises exporting products worth 
205 billion ringgits. The government's goal is 
that by 2020, the Halal industry should make 
up 8.7% of GDP. 

Malaysian scientists are well positioned to 
develop quality standards and certification 
systems for Halal products. A range of alter- 
native and substitute ingredients are needed 
for pharmaceuticals, cosmetic products and 
foods. Quality and safety must be built in. 
Food traceability is crucial to allow compa- 
nies to have confidence in supply chains and 


TECHNOLOGY GAME-CHANGERS 


Five priorities for Malaysian research 


Investment in the following areas would 
best support the nation’s growth’. 


Biotechnology. Agriculture has been 

a key sector for Malaysia’s economic 
growth. Precision farming is set to increase 
yield, efficiency and market reach, and 
sustainable agriculture needs to be 
developed. Biotechnology in health care is 
another priority. For example, transgenic 
plants and genetically engineered animals 
might produce pharmaceutical substances 
more cheaply than conventional methods. 


Digital technology. Malaysia has to tackle 

its digital divide, especially between rural 
and urban communities. Whereas 78% 

of households nationwide have Internet 
access, only 38% of rural households 

do®’. Improving broadband access would 
enable poor and rural communities to share 
information and create jobs and wealth. 
Real-time data analysis, through the Internet 
of Things and cloud computing for example, 
could inform decision-making across sectors. 


Green technology. Malaysia has pledged 
to reduce greenhouse-gas emissions by 


to isolate affected goods quickly should a 
problem arise. For example, geographic infor- 
mation systems (GIS) and radio-frequency 
identification trackers can monitor products 
being transported from farm to user. 

Malaysia is a leader in Islamic finance, 
having a large asset base, many institutions 
and a strong regulatory framework in this 
area. The country should encourage the 
development of alternative forms of finan- 
cial technology through cloud computing, 
big-data analytics, artificial intelligence 
and blockchain. 


PULL TOGETHER 

Malaysia wants to be a strong scientific 
nation. Successive governments have paved 
the way, seeing science and technology as the 
backbone of development. Now the country 
needs a lean and well-governed R&D system, 
spearheaded by a central agency and centred 
on people. 

The new government, elected in May, must 
start by setting up a central research manage- 
ment agency — similar to the US National 
Science Foundation — to co-ordinate, plan, 
monitor and evaluate the country’s science, 
technology and innovation. That agency 
could be complemented by a technology 
commercialization agency — for example, 
one resembling Innovate UK — to drive inno- 
vation and commercialization. 


an ambitious 45% by 2030 as part of 

the Paris Agreement. Renewable-energy 
technologies must be advanced to reduce 
dependency on fossil fuel. Technologies 
might include fuel cells, solar and ocean 
thermal-energy technologies. 


Nanotechnology. Real-time health 
monitoring using nanosensors and 
wearables would improve preventive 
health care. People could track activities, 
vital signs, medications and mental 

and physical states to facilitate early 
intervention. Affordable over-the-counter 
biomarker kits should be developed as 
diagnostic tools, because survival rates are 
higher for diseases such as cancer if they 
are caught early. 


Neurotechnology. Neurological diseases, 
such as Alzheimer’s and Parkinson’s, are 
on the rise and are expected to become 
the second leading cause of morbidity and 
mortality in Malaysia, after heart disease. 
Incidences are expected to rise in the 

next decade, owing to longer lifespans 

and lifestyle and environmental changes. 
Neurogenerative treatments are needed. 


More funding will also be needed to 
promote the later stages of product develop- 
ment. The research and commercialization 
agencies must forge links among industry, 
academia and communities. 

Developing international collabora- 
tions will be important to share Malaysian 
strengths and values, to exchange knowledge 
and to spur innovation. 

To bea progressive nation, Malaysia must 
pursue economic growth, human develop- 
ment and societal well-being. m 


Asma Ismail is the president of the Academy 
of Sciences Malaysia, Kuala Lumpur. 
e-mail: asmainformm@yahoo.com 
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The Form and Function room at the Steinhardt Museum of Natural History in Tel Aviv. 
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Israel’s wild treasury 


Josie Glausiusz tours Tel Aviv’s stunning new museum of natural history. 


glass case crammed with a glorious 
A umble of stuffed animals and birds 

greets me as I enter the atmospheric 
Treasures of Biodiversity gallery at the new 
Steinhardt Museum of Natural History in 
Tel Aviv, Israel. This cabinet of curiosities 
contains century-old specimens collected 
by German ornithologist Ernst Schmitz. In 
1908, Schmitz travelled to Jerusalem — then 
part of the Ottoman Empire — and opened 
a natural history museum showcasing a 
spectacular collection of rare mammals, 
butterflies and other insects, reptiles and 
birds’ eggs, many gathered by local Bedouin 
people. These riches are now on display 
among the museums vast holdings. 

“This is the last Nile crocodile in Israel,” 
says Tel Aviv University zoologist Tamar 
Dayan, chair of the museum and the 
driving force behind its creation. “This 
is the last cheetah in Israel. The leopard 
comes from the Judean Mountains, from 


Steinhardt Museum of Natural History 
Tel Aviv, Israel. 
Preliminary opening from 2 July. 


a population that no longer exists. This is 
the last oryx in Israel, and the last bearded 
vulture. We have a lot of ‘lasts’ here.” The 
glass case encapsulates the loss of many 
species — a full third of the large mam- 
mals in what is now Israel. Conveying the 
magnitude of this change is a key part of 
the museum’s mission. 

In this, it succeeds. Much of the exhibi- 
tion space is inevitably allocated to dis- 
playing some of its 5.5 million skeletons 
and other specimens of flora and fauna, 
previously dispersed over the campus of 
the adjacent Tel Aviv University. But the 
causes of biodiversity loss in Israel are 
also front and centre. Charts in Hebrew, 
English and Arabic deliver the facts on cli- 
mate change, habitat destruction, invasive 
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species, pollution and over-exploitation of 
marine resources. A 6-metre-long inter- 
active map of the region enables visitors 
to witness a century and a half of environ- 
mental impacts. For example, over-use of 
water has indirectly shrunk the Dead Sea, 
and unchecked construction has destroyed 
some two-thirds of Israel’s coastal sand 
dunes since the country was established in 
1948. It points, too, to the impacts of inva- 
sive species, pollution and hunting. 

Some two decades in the making, the 
museum itself is an architectural jewel, 
designed by Kimmel Eshkolot Architects in 
Tel Aviv to resemble a treasure chest. The 
light-filled building — partly surrounded by 
an ark-like facade of wood veneer — can be 
navigated entirely on ramps and walkways, 
and is fully accessible for wheelchairs and 
prams. As at the Darwin Centre in London's 
Natural History Museum, visitors can gaze 
through windows beyond the taxidermy 


ITAL BENIT 


to watch scientists studying taxonomy and 
systematics, biogeography and ecology, 
invasion biology, evolution, zooarchaeol- 
ogy, archaeobotany and more. Five hundred 
researchers will work in the museum. 

The ramp system drew some criticism at 
the design stage from people concerned that 
it would encourage children to run around. 
Architect Michal Kimmel Eshkolot thinks 
that this is a good thing: public buildings, 
she tells me, “should be friendly to people, 
“This is the inviting, the oppo 


: site of intimidating” 
last oryx in Children are 
Israel, and the just as likely to be 
last bearded mesmerized by 
vulture. We have the exhibits. The 
alot of ‘lasts’ Form and Function 
here.” room is filled with 


artfully mounted 
animals — including a caracal leaping 
to capture a ground-dwelling bird called 
a black francolin— and the skeletons of 
native and non-native mammals and birds. 
A replica velociraptor hangs from the ceiling 
(although, as Dayan admits, the only traces 
of dinosaurs found in Israel are footprints). 
Bugs and Beyond celebrates entomology, 
including cases filled with live Madagascar 
hissing cockroaches, giant stick insects and 
pinned tarantulas and butterflies of all col- 
ours and sizes. Kids can also watch a video of 
a dead rodent decomposing in half a minute, 
its fur and bones replaced by grass and a 
graceful toadstool. 

Creative use of space is integral. Six shal- 
low dioramas showcase the mammals, birds 
and insects found in six Israeli habitats: 
desert, sandy, aquatic, Mediterranean wood- 
land and scrubland, the Hermon mountains 
and non-native, fire-prone pine plantations. 
A cast of a giant minke whale skeleton (in 
whose bones pigeons nest) hangs in an 
alcove open to the sky. 

The museum's most dramatic display is 
also airborne. The Great Bird Migration 
is a swirl of avian travellers soaring high 
above our heads in the great entrance hall. 
Among them are the common crane, grey 
heron, greater flamingo, osprey, white stork, 
great white pelican and black kite. They 
represent some of the 500 million birds that 
fly over Israel in spring and autumn. Many 
are listed among the winged creatures that 
the Hebrew Scriptures’ book of Leviticus 
prohibits the Israelites from eating. It’s a 
reminder, perhaps, that these birds have 
been present in the region for thousands of 
years and probably much longer. And that, 
if we do not protect them, this is how they 
will end up: stuffed. m 


Josie Glausiusz writes about science and 
the environment for magazines including 
National Geographic, Scientific American 
and Hakai. 

Twitter: @josiegz 
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Books in brief 


Lessons from the Lobster: Eve Marder’s Work in Neuroscience 


Charlotte Nassim MIT PRESS (2018) 

For 40 years, neuroscientist Eve Marder has researched a tiny clutch 
of specialized neurons controlling the crustacean stomach — the 

Prom Te LOBSTER stomatogastric ganglion. From that intense, data-centred process, 


EVE MaRDe> 
NEUROSCiE yet ORK IN 


she has gleaned key findings on the operation of neuronal circuits, 
neuronal homeostasis and neuroplasticity. Charlotte Nassim’s 
richly detailed ‘thought biography’ unpeels the minutiae of lab life, 
revealing how Marder, “without technological fireworks or lavish 
funding”, has illuminated areas of human neuroscience such as 
brain variability. A nuanced portrait of an inspired scientist at work. 


Aroused 

Randi Hutter Epstein W. W. NORTON (2018) 

Hormones may be ringmasters of the bodily circus, controlling 
everything from sex to metabolic function, but in this invigorating 
history they become stars of the show. Medical journalist Randi 
Hutter Epstein navigates endocrinology’s messy evolution through 
players such as neurosurgeon Harvey Cushing, who indefatigably 
researched the pituitary ‘master gland’, and driven Nobel laureate 
Rosalyn Yalow, who co-invented radioimmunoassay. Here, too, is the 
wilder side, from a testicle-swapping experiment on roosters to the 
animal-ovary elixirs once prescribed for menopausal women. 


The Art of Logic 

Eugenia Cheng PROFILE (2018) 

Nothing in the world, notes mathematician Eugenia Cheng, behaves 
according to logic. Yet, in an era awash with conflict, exploitation, 
tribalism and fake news, the “illuminating precision” offered by logic 
is important. Cheng harnesses the power of abstraction to explore 
real-life phenomena such as sexism and white privilege. She walks 
us through the grand terrain of logic, from axioms to proofs. And she 
reveals how to build arguments as long chains of logical implications 
—a “virtuosic and masterful” skill that, combined with intelligent 
emotional engagement, can cut through pervasive irrationality. 


Rising 

Elizabeth Rush MILKWEED EDITIONS (2018) 

This evocative exercise in lyrical reportage by Elizabeth Rush tracks 
sea-level rise in the here and now, by way of the disintegrating shores 
and salinated soils of the coastal United States. Rush journeys from 
the low-lying Isle de Jean Charles off Louisiana to Maine, Florida, 
New York and beyond, gathering stories from field biologists, climate 
scientists and beleaguered citizens as she goes. She touches, too, on 
the ten successive Atlantic storms that became hurricanes in 2017, 
from Franklin to Ophelia. At once a powerful group portrait of lives 
and communities on the brink, and a lament for lost habitats. 


Twelve Tomorrows 

Wade Roush (ed.) MIT PRESS (2018) 

This MIT Technology Review anthology is a science-fictional 
exploration of emergent technologies and a veritable constellation 
of brilliant writers, among them Liu Cixin, Ken Liu, Alastair Reynolds, 
Elizabeth Bear and Sarah Pinsker. Ken Liu is on masterful form 

in ‘Byzantine Empathy’, a visceral narrative shaped around 
cryptocurrency; Pinsker’s ‘Escape from Caring Seasons’ plays with 
wrist chips and drone armies; and Bear’s ‘Okay, Glory’ features a 
smart home turned kidnapper. A profile of esteemed sci-fi author 
Samuel R. Delany is included. ‘Hard’ sci-fi at its best. Barbara Kiser 
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Alan Bean at an exhibition of his work at the National Air and Space Museum in Washington DC in 2009. 


The artist who walked 
on the Moon: Alan Bean 


Richard Taylor pays tribute to the Apollo astronaut who 
beautifully meshed science and art. 


old, my father pointed to the Moon and 

told me that a man was walking on it. I 
looked up and wondered what he was doing 
in that remote, crater-riddled land. I later 
learned that his name was Alan Bean, and that 
he was the fourth of only 12 humans so far 
to walk on another world. Even in that select 
group, he was unique: he was the only one to 
record what he saw on canvas and in paint. In 
May, he died at the age of 86. 

As my interest in space travel grew, I read 
about what led Bean to his Apollo 12 Moon 
landing. Earning an aeronautical-engineer- 
ing degree from the University of Texas at 
Austin in 1955, he soon achieved his child- 
hood dream of becoming a Navy test pilot. 
His instructor was Pete Conrad, later a fel- 
low member of the Apollo 12 mission and 
Moon-walker, who became his closest friend. 
Inspired by the “sights, sounds and smells of 
high performance flying machines’, as Bean 
put it, they hatched their plan to ride the big- 
gest flying machine of them all. 

Standing 110 metres tall, the Saturn V 
remains the most powerful rocket ever flown. 
Four months before the Apollo 12 launch, 
one of these behemoths had carried Neil 
Armstrong and his crew to the first Moon 
landing. But whereas Armstrong took off 
ona sweltering summer's day, Bean, Conrad 


I: November 1969, when I was six years 
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and fellow astronaut Richard Gordon sat on 
their rocket engulfed by a winter thunder- 
storm. Thirty-six seconds into their launch, 
the unthinkable happened. The Saturn V was 
struck by lightning — twice. “I looked up at 
the display that had all of the caution lights 
and there were more on than Id ever seen in 
my life,” Bean recalled. Seconds away from 
aborting the mission, he managed to reboot 
the affected systems. The astronauts’ nervous 
laughter could be heard all the way to orbit. 

Previous astronauts behaved with reserve. 
Bean gave a glimpse of the more human side 
of being a space explorer. Armstrong com- 
menced his historic landing with a deadpan 
“See you later”, descending to the Moon's 
surface in tension-building silence. Bean 
sounded like an excited tourist. His commen- 
tary seemed to touch on whatever popped 
into his mind: from the view (“Looks good 
out there, babe, looks good”) to the relief of 
seeing his landing spot in the Ocean of Storms 
(“There’s that crater right where it’s supposed 
to be”), to complementing Conrad on his fly- 
ing skills (“Youre beautiful”). 

Once Bean had ignited my six-year-old 
imagination, I was on my way to the life sci- 
entific. I drew make-believe planets, the real 
Solar System, spaceships and alien cities — 
even how aliens might play cricket without 
gravity. By 1984, close to finishing my physics 


degree, I was — like him — grappling with 
competing desires to pursue science and art. 

Meanwhile, the world was celebrating the 
15th anniversary of the first Moon landing. 
Television screens were flooded with Apollo 
astronauts reminiscing. Seeing the blue Earth 
hanging like an oasis in the inky darkness 
filled many of them with a deep spiritual con- 
nection to the Universe. Bean, more down-to- 
Earth, appreciated all that the Moon lacked. 
“Since that time I have not complained about 
the weather a single time... ’ve not com- 
plained about traffic; he said. “When I got 
back home, Id go down to shopping centres 
and ... just watch the people go by and think, 
‘Boy, we're lucky to be here” 

Bean's scientific legacy is fascinating. He 
brought back a Moon rock known as KREEP 
(potassium, rare-earth elements, phospho- 
rus). Its composition led to a new model of 
lunar formation: the giant-impact hypoth- 
esis. Still being refined by research, this pic- 
tures the Moon forming in collisions between 
Earth and one or more planet-sized objects. 

Bean flew once more for NASA, in 1973: 
he spent a record-breaking 59 days orbiting 
Earth as commander of the space-station mis- 
sion Skylab 3. In 1981, he left the agency to 
work out how best to tell his story. How could 
he describe what it was like to hurtle home 
at 40,000 kilometres per hour, or to place his 
thumb in front of Earth and block from view 
everything he knew? He found his answer in 
painting. He mixed Moon dust into his acryl- 
ics, and used his Apollo hammer and boots to, 
in his words, “sculpta textured surface unique 
in all ofart history”. (Many of Bean’s works are 
reproduced in his 2009 book Painting Apollo: 
First Artist on Another World.) 

Bean’s art is important in other ways. 
Apollo 8 astronaut Bill Anders’s stunning 
photograph Earthrise, taken from lunar orbit, 
is iconic. But Bean's art goes further: it adds 
emotion to its extraordinary scenes. Self- 
described as one of the more fearful astro- 
nauts, he was aware that death was always 
near. That comes through in his paintings. 
Whether we see astronauts deploying equip- 
ment, the Service Module flying across the 
lunar surface, or Earth peeking above the 
horizon, there’s a feeling of being far from 
home — in both distance and difficulty. The 
loneliness in the works reminds me of Frank 
Hurley’s photographs of Ernest Shackleton’s 
epic 1914-17 journey to the Antarctic. 

Above all, Bean’s paintings serve as an 
antidote to that foolish idea that emerged 
in the 1980s: that our brains are wired to be 
either artistic or scientific. Inspired by his 
example, I went on to be a professor of both 
art and science. He showed that it was a sim- 
ple matter. You just follow your dreams. = 


Richard Taylor is professor of physics, 
psychology and art at the University of 
Oregon in Eugene. 

e-mail: rpt@uoregon.edu 
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One wolf shot 
is too many 


Denmark’ position as a haven 
for recolonizing top predators 
has been seriously undermined 
by the shooting in April of one of 
the first wolves to be born in the 
country in almost 200 years. 

The Eurasian wolf (Canis lupus 
lupus) reappeared in Denmark 
before 2012 after at least eight 
individuals immigrated from 
Germany and Poland (see 
go.nature.com/2mfaxxz). The 
subpopulation is protected under 
the European Union's Habitats 
Directive. 

The killing of the female wolf 
arose from controversy among 
hunters, farmers and politicians 
over the recolonization. It was 
widely publicized on social 
media. We call for greater 
political accountability and better 
management of the country’s wolf 
population: at a minimum, there 
should be protected denning 
areas, sufficient economic 
compensation for sheep farmers 
and satellite tracking of wolves. 

Without such measures, 
Denmark's role in setting 
the agenda for international 
agreements such as the United 
Nations Red List of Threatened 
Species could rightly be called 
into question. 

Christian Sonne Aarhus 
University, Roskilde, Denmark. 
Aage K. O. Alstrup Aarhus 
University, Aarhus, Denmark. 
cs@bios.au.dk 


Al threat to drug 
development 


Artificial intelligence could 
help to identify more-effective 
candidate drugs (see Nature 557, 
$55-S57; 2018). However, this 
dream held by patients, clinicians, 
physicians and the public-health 
system could become a nightmare 
for the pharmaceutical industry. 
The development ofa clinically 
active ingredient generally costs 
hundreds of millions of euros, 
so the compound needs to be 
protected by a worldwide patent 
for the process to be economically 


feasible. A patent is granted only 
when a compound's application 
can be classified as both ‘new 
and ‘invented. A highly effective 
compound thrown up by an AI 
algorithm could indeed be new. 
Whether it is ‘invented’, however, 
is debatable. This is because the 
inventor might be considered 

as either the algorithm (so not 

a person) or its programmer. 

It could be argued that if there 
is a connection between the 
program and the compound's 
structure, then it is predictable 
by experts and so no longer 
inventive. Or, ifthe programmer 
cant explain how the AI 
algorithm found the structure, 
then he or she didn't invent 
anything. Assigning and agreeing 
on intellectual-property rights 
will be even more complex when 
different parts of the process 
behind important discoveries 
involve multiple contributors. 

Patent law would presumably 
have to be adapted, for example by 
acknowledging the development 
work through a temporary ban on 
imitation. 

Lutz Heuer Dormagen, 
Germany. 
nievenheimer1@t-online.de 


Young scientists aim 
to prioritize patients 


Translational medicine helps to 
prevent scientific research from 
being wasted by focusing on the 
long-term benefits for patients. 
As early-career researchers, 

we want to accelerate that 
process — rather than waiting for 
senior researchers to spearhead 
the necessary changes. 

Our international network 
of aspiring clinician-scientists, 
called Apollo, collaborates 
with senior scientists from 
the EUREKA Institute for 
Translational Medicine in 
Syracuse, Italy. We aim to learn 
about the goals, opportunities 
and challenges of translational 
medicine, including finding 
a more-efficient way to use 
research funding and balancing 
the competing interests of the 
different parties involved in 


research. We organize meetings, 
student workshops, mentorship 
programmes and an annual 
training programme. 

We hope to develop the skills to 
navigate and improve the drug- 
discovery pipeline. Speeding up 
research delivery from bench to 
bedside will also entail changing 
how scientists are evaluated, as 
well as promoting collaboration 
across disciplines (R. Benedictus 
et al. Nature 538, 453-455; 2016). 
Remi Stevelink*, Gautam Kok* 
Apollo Society, University Medical 
Center Utrecht, the Netherlands. 
info@apollosociety.eu 
*On behalf of the boards of Apollo 
Utrecht and Toronto; see www. 
apollosociety.eu for details. 


Cite the online date 
of publication 


With online delivery increasingly 
dominating scientific publishing, 
most long-established journals 
run papers in both print and 
online formats — but not 
necessarily simultaneously. 

This can affect how researchers 
are given scientific priority. 

In our view, scholars from all 
disciplines should use the earlier, 
online citation date, rather 

than defaulting to print as the 
traditional record. 

In our experience, the time lag 
between the two can be as long as 
6 months. This might be crucial 
for annual research evaluations, 
for instance, when a paper is 
published online at the end of one 
year and in print the year after 
(see, for example, A. L. Woerman 
et al. Acta Neuropathol. 135, 
49-63; 2018, originally published 
online in August 2017). Timings 
are also key when two competing 
groups publish papers that report 
similar findings. Young scientists, 
in particular, need to have the 
publication date of their original 
work accurately recorded. 

Using the online publication 
date as the primary citation 
would dispel such confusion. 
Reference lists that include dates 
for both publication formats and 
are picked up by public databases 
such as PubMed area step in the 


right direction. Editors, authors 
and indexers need to work 
together to manage any effect on 
priority when publishing a paper 
in both formats. 

Michael A. Keller Stanford 
University, California, USA. 
Stanley Prusiner Institute for 
Neurodegenerative Diseases, 
University of California, San 
Francisco, California, USA. 
michael. keller@stanford.edu 


Scavengers need 
help from IPBES 


Scavenger conservation is in 
jeopardy, which risks amplifying 
current threats to biodiversity 
and ecosystem services. In our 
view, ill-informed policies for 
managing carrion left over from 
farming, hunting and fisheries 
are largely to blame. We call for 
the Intergovernmental Science- 
Policy Platform on Biodiversity 
and Ecosystem Services (IPBES) 
to help integrate such policies 
more effectively with their 
scientific implications as it draws 
up its 2020-30 programme (see 
Nature http://doi.org/cq8r; 2018). 
Scavengers are in decline 
worldwide. Vulture populations 
in Asia crashed after consuming 
livestock carcasses that contained 
the anti-inflammatory drug 
diclofenac (J. L. Oaks et al. Nature 
427, 630-633; 2004). Elephant 
and rhino carcasses poisoned by 
poachers are decimating African 
vultures and apex predators 
(D. Ogada et al. Conserv. Lett. 
9, 89-97; 2016). And changes 
in how discards from European 
fisheries are managed could 
adversely affect endangered 
seabirds (A. W. J. Bicknell et al. J. 
Appl. Ecol. 50, 649-658; 2013). 
The declines stand to 
compromise nutrient recycling, 
disease control and waste 
disposal, all of which contribute 
to public health and food security. 
Patricia Mateo-Tomas Research 
Unit of Biodiversity, Oviedo 
University, Mieres, Spain. 
Pedro P. Olea Department of 
Ecology, Universidad Autonoma 
de Madrid, Spain. 
rktespejos@gmail.com 
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NEWS & VIEWS 


Rapid formation of Mars 


An analysis of meteoritic material from Mars provides an accurate timeline of the planet’s early history. The results have 
major implications for our understanding of the processes involved in rocky -planet formation. SEE LETTER P.586 


LINDA T. ELKINS-TANTON 


uring their formation, many rocky 
D planets go through a phase known as 

amagma ocean, during which they are 
mostly or completely molten. Many research- 
ers thought that the solidification of Mars’s 
magma ocean was protracted, perhaps last- 
ing for up to 100 million years (Myr) after the 
ocean’s formation’. But on page 586, Bouvier 
et al.’ show that this process was completed in 
less than 10 Myr. The finding suggests that 
habitable conditions existed on Mars up to 
100 Myr before they did on Earth. 

In the contest between scientific models, 
empirical evidence is the arbiter. More than 
100 meteorites that originated on Mars have 
been identified on Earth, providing samples of 
the Martian crust. And advances in the sensi- 
tivity of instruments that measure the concen- 
trations of individual isotopes allow the ages 
of these materials to be determined with high 
precision. 

Bouvier and colleagues looked for minerals 
known as zircons in Martian meteoritic 
material. When a zircon crystallizes from its 
parent magma, its crystal structure allows stray 
uranium atoms to be trapped in the growing 
crystal, but rejects lead atoms. Consequently, 
when researchers study these minerals billions 
of years later, they can be confident that any 
lead in the crystals was produced by uranium 
decay and that no other sources of lead need 


a_ Less than 10 Myr 


Magma ocean 


Figure 1 | The evolution of Mars. Bouvier et al.’ analysed minerals 
known as zircons in meteoritic material from Mars, and determined 

a timeline for the planet's early history. The numbers represent the 
approximate time, in millions of years (Myr), since the formation of 
the first solids in the gas disk around the young Sun. a, After Mars had 
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b_ Less than 20 Myr 


Solidified magma 


to be considered. Furthermore, two uranium- 
lead decay processes (”°U to *”’Pb and **U to 
*°°Pb) can be used simultaneously to improve 
the precision of the results. Uranium-lead geo- 
chronology using zircons therefore yields the 
most precise ages of ancient geological materi- 
als that are currently possible. 

The authors analysed seven hard-earned 
zircons and obtained ages ranging from 
4,476 to 4,430 Myr. For comparison, the first 
solids in the gas disk around the growing 
young Sun, known as calcium-aluminium- 
rich inclusions (CAIs), formed 4,567.3 Myr 
ago’. Therefore, in the astonishingly short 
interval of 90 Myr, Mars grew from dust to a 
planet, solidified from its initial magma-ocean 
state and formed a crust containing zircons. 

This result already shows that models pre- 
dicting a protracted magma-ocean stage on 
Mars'~ cannot be correct, but Bouvier and co- 
workers’ study yielded even finer constraints. 
The lutetium-hafnium decay process, Tuto 
“HE can be used to constrain the melting his- 
tory of the zircons’ parent magmas, because 
the two isotopes behave differently during 
melting. The authors found that the zircons 
have unusually low concentrations of '”°Hf. 
This indicates that the parent magmas had 
lower amounts of ‘Lu than would be expected 
if they originated from the solidified products 
of Mars’s magma ocean. To form these parent 
magmas, the planet must have partially melted 
after it had solidified. 


ocean 


crust. 


Bouvier and colleagues’ findings provide a 
revised timeline for the early stages of Mars's 
history (Fig. 1). The planet grew to approxi- 
mately its current size within less than 10 Myr 
(and probably less than 5 Myr) of the forma- 
tion of CAIs, It then took less than 10 Myr to 
solidify from its initial magma-ocean phase. 
To put these timescales into perspective, ifthe 
Solar System were one day old, Mars would 
have fully formed in the first 6 minutes. About 
20 Myr after the formation of CAIs, the planet 
partially melted to produce magmas that rose to 
the planet’s surface; 70 Myr later, these magmas 
had solidified to form a zircon-containing crust. 

The rapid solidification of Mars’s magma 
ocean has important implications for our 
understanding of both Mars and the planet’s 
formation of rocky planets in general. The 
speediness suggests that heat was easily lost 
from Mars, which implies that the planet’s 
atmosphere was relatively thin. Two processes 
could have produced such an atmosphere: a 
low release of volatile gases from the magma 
ocean; and a stripping of the atmosphere by the 
young Sun*’. Researchers can now constrain 
the extent to which such processes occur much 
more closely, and can apply the results to the 
young Earth. 

The early growth and magma-ocean phase 
of Mars, and, by extension, of other planetary 
embryos, means that at least some of the 
planet’s formation probably happened while 
the gas disk was still present around the young 


Zircon-containing 
crust 


grown to approximately its current size, it existed in a magma-ocean phase, 
in which it was mostly or completely molten. b, The magma ocean solidified. 
c, The planet partially melted to produce magmas that rose (white arrows) to 
the planet’s surface. d, These magmas solidified to form a zircon-containing 
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Sun — on average, such disks exist for only a few 
million years””. Therefore, there is strong reason 
to think that gas in the disk would have diffused 
into the magma oceans on these embryos. 

This diffusion process could help to answer 
some long-standing questions about, for 
example, the noble-gas content of Earth. 
Today, Earth releases noble gases that must 
have been implanted in the mantle at the time 
of the planet's formation. The origin of these 
gases has been unclear because the rocky 
material that built Earth contained only a small 
quantity of noble gases. The diffusion of noble 
gases from the gas disk directly into the magma 
ocean might solve the mystery. 

Finally, Bouvier and co-workers’ timeline 
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allows the early histories of Earth and Mars 
to be compared directly. About 100 Myr after 
the formation of CAIs, Earth went through a 
magma-ocean phase that is thought to have 
been initiated by the collision of the planet 
with a Mars-sized body — a collision that 
led to the formation of the Moon". Conse- 
quently, the authors’ results suggest that Mars 
had clement conditions, and was possibly even 
hospitable to the formation of life, for as long 
as 100 Myr before such conditions existed on 
Earth. Mars had a head start on Earth in the 
planetary-evolution game. m 


Linda T. Elkins-Tanton is at the School of 
Earth and Space Exploration, Arizona State 


Cancer drug tackles 
overgrowth syndrome 


Abnormal activity of the enzyme PI3K can drive cancer growth, and mutations in 
a PI3K subunit can sometimes lead to non-cancerous overgrowth. A cancer drug 
that inhibits PI3K dramatically reduces such overgrowth. SEE ARTICLE P.540 


ROBERT K. SEMPLE 
& BART VANHAESEBROECK 


esearchers who investigate rare genetic 
R conditions live in hope that the discov- 

ery of disease-causing mutations will 
lead swiftly to tailored treatments. Sadly, this 
is not often the case, because genetic defects 
usually cause impairments that are difficult 
or impossible to tackle using available medi- 
cines. In this issue, Venot et al.’ (page 540) now 
provide a rare exception to this rule. In severe 
non-cancerous overgrowth syndromes caused 
by mutations in an enzyme called PI3K, they 
show the beneficial effects of a PI3K-inhibi- 
tor drug that was initially developed to treat 
cancer. Their results bring the possibility of a 
transformative therapy for people with over- 
growth conditions one step closer. 

The development of humans, from a single 
fertilized egg to an adult body that contains 
around 37 trillion cells” while maintaining 
symmetrical, paired body parts, is an astonish- 
ing feat that requires the lifelong coordination 
of cell division, survival and death. Growth- 
factor proteins can aid cellular coordination 
by acting on cell-surface receptors to stimulate 
intracellular signalling networks. These net- 
works often include PI3K, which is essential 
for the regulation of growth and development 
by insulin and insulin-like growth-factor 
hormones. 

Cancer arises from a flagrant breach of the 
rules of good cellular citizenship that are essen- 
tial in multicellular organisms, and cancer cells 


acquire genetic abnormalities that subvert the 
checks and balances that constrain cell growth 
and migration. Mutations that activate PI3K 
signalling — mainly those in the gene PIK3CA, 
which encodes p110a, a catalytic subunit of 
PI3K — are among the most common muta- 
tions to drive solid cancers*. Such signalling can 


PI3K 


inhibitor 
> 


PIK3CA mutations 
arise in a mosaic 
pattern in the 
overgrown limb region 
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also be activated by mutations that inactivate 
the enzyme PTEN, which normally keeps 
PI3K activity in check. The link between over- 
active PI3K signalling and cancer motivated 
researchers to develop compounds known as 
PI3K inhibitors. However, the clinical impact of 
these drugs on cancer has been less impressive 
than hoped because of toxicity associated with 
high doses. And even when such drugs succeed 
in inhibiting activated PI3K, other proteins can 
compensate to provide alternative pathways 
that promote cancer”. 

In 2012, certain PIK3CA mutations, which 
had previously been linked to cancer, were 
reported to cause rare, non-cancerous forms 
of overgrowth in people*’. A hallmark of these 
overgrowth syndromes is abnormal, excessive 
tissue growth that affects the body in a patchy 
and asymmetrical manner. This overgrowth 
is caused by PIK3CA mutations that occur 
after the start of embryonic development and 


PI3K 
inhibitor 


Figure 1 | People who have an overgrowth syndrome respond to treatment with a cancer drug. 
Venot et al.’ investigated a syndrome linked to abnormal activation of the enzyme PI3K, which can 
result in the non-cancerous overgrowth of a variety of tissues. The authors tested whether a low dose of 
a PI3K inhibitor called alpelisib, developed previously as a cancer therapy, could treat people who have 
mutations in the gene PIK3CA, which encodes the catalytic subunit of PI3K. In overgrowth syndromes, 


5,6,8 + 


these PIK3CA mutations arise in a mosaic patchwork pattern” in the region of the affected tissue (pink). 
Alpelisib treatment caused substantial improvements in the 19 recipients. Two examples of the decrease 
in overgrown tissue in patients after six months of drug treatment are shown. 
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only in some cells”””, which leads to cellular 
overgrowth in a mosaic-like pattern. The 
severity of the condition varies from person 
to person, and ranges from an isolated skin 
growth to a complex multisystem disorder 
called CLOVES syndrome’, which comprises 
considerable and often widespread overgrowth 
that contains an abundance of fat cells and 
abnormal blood vessels. PIK3CA mutations 
are a common feature of many overgrowth 
syndromes and the term PROS (for PIK3CA- 
related overgrowth spectrum)’ is used as a 
unifying description of such cases. 

PROS disorders do not seem to be linked 
to an increase in the risk of forming the solid 
cancers in which PIK3CA mutations are most 
prevalent’. Although the reason for this is 
unclear, the PIK3CA mutations associated 
with such disorders usually occur in cell 
types of a different embryonic origin from 
those that develop cancer linked to PIK3CA 
mutations’. 

Severe PROS disorders can be debilitating 
or even life-threatening. Overgrown tissue 
causes compression that can lead to vascular 
problems or organ dysfunction. Treatments 
aim to reduce excess tissue by surgery or by the 
physical blockade of enlarged blood vessels, 
and other therapy options are needed urgently. 
The availability of targeted inhibitors of p110a 
that had already undergone clinical testing as 
treatments for cancer gave researchers hope 
that these might offer a new therapy for PROS 
disorders. Yet questions arise about the effect 
of prolonged patient exposure to these drugs. 
Would this cause side effects? Would their 
cells adapt to dull the effect of such treatment, 
as occurs in cancer*? And would the over- 
growth be amenable to reversal by drug-based 
therapy? 

Venot et al.’ take an important step towards 
addressing these questions. Previous attempts 
to model PROS disorders in mice engineered 
to express Pik3ca containing disease-causing 
mutations produced excess growth in only 
some of the expected tissues’®. Venot and 
colleagues engineered another mouse model 
of a PROS disorder, in which an artificial 
system was used to make the mice express 
constitutively active p110a in all tissues. These 
animals developed problems similar to those 
in people with PROS conditions, including 
the overgrowth of adipose, muscle and vascu- 
lar tissue, and experienced a premature death 
caused by vascular complications. When the 
authors treated the mice with a PI3K inhibi- 
tor called alpelisib, an impressive, rapid and 
substantial decrease in the amount of over- 
grown tissue occurred, which prevented the 
premature death of the animals. 

Crucially, Venot et al. then assessed the 
effects of alpelisib in 19 people with PROS 
disorders who had severe or life-threatening 
complications. In adults, the team adminis- 
tered the lowest dose that had been tested in 
trials on people with cancer (250 milligrams 
a day), and in children they used a dose of 
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50 milligrams a day. Dramatic anatomical 
and functional improvements occurred in all 
patients across many types of affected organ 
(Fig. 1), with some benefits noted within days 
of the treatment starting. The study was not 
randomized, blinded or subject to placebo 
control, yet these striking initial results sug- 
gest that this outcome is likely to have clinical 
importance. Resistance to alpelisib was not 
observed, and the drug was well tolerated by 
the recipients. 

A predicted side effect of PI3K inhibition is 
a high blood glucose level, caused by interfer- 
ence with the PI3K-mediated metabolic effects 
of insulin. However, blood-glucose elevation 
occurred in only three people, in whom the 
elevation was modest. In children, the drug did 
not have an effect on normal growth, which 
suggests that overgrown tissue can be targeted 
without harmfully blocking PI3K-dependent 
childhood growth. Further systematic clinical 
studies are now needed, and ethics committees 
will have to assess whether it could be justified 
to include a placebo in trials on patients who 
are severely affected. 

The study of PI3K inhibition as a treatment 
for PROS disorders might also offer something 
in return towards the design of cancer thera- 
pies. The aim of PI3K-inhibitor therapy in 
PROS conditions would be to suppress disease- 
causing levels of PI3K signalling, while mini- 
mizing any side effects during the long-term, 
and probably lifelong, treatment. By contrast, 
the conventional approach of cancer therapy 
involves identifying differences between 
healthy and cancerous cells, and then hitting 
the cancer-specific characteristics as hard as 
possible to induce the death of cancer cells. 
In clinical trials for cancer, PI3K inhibitors 
are usually studied at the maximum tolerated 
dose. Yet whether this makes sense is unclear, 
given that the activity level of mutated p110a 
in cancer, which is low, is probably similar to 
the activity level of the same subunit in PROS 
disorders. Could a low dose of PI3K inhibi- 
tor be beneficial in treating a cancer linked 
to a PIK3CA mutation? This might abolish 
any PI3K activity that is above the usual level 
without completely blocking PI3K signalling, 
as would occur with a high dose of inhibitor. 
This could be tested, for example, as a strategy 
for preventing types of cancer in which PI3K 
activation or PTEN inactivation is an early 
event!’”’, or for preventing PTEN hamar- 
toma tumour syndrome in people who carry 
a mutation in PTEN and are therefore prone 
to cancer”. 

Low-dose PI3K inhibition might also be 
used as an option, following conventional 
treatments such as chemotherapy or surgery, 
for slowing the evolution of cancer and its 
adaptation to selective pressures”. And long- 
term, low-dose PI3K inhibition could offer 
further benefits — for example, it increases 
the metabolic health of obese mice and rhesus 
monkeys”, and sustained blockade of the PI3K 
pathway can slow ageing in animal models”®. 
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Perhaps it is time to target abnormal signal- 
ling in cancer with a lighter touch, which could 
enable the use of combination therapies that 
are currently precluded for reasons of toxicity. 
After all, there is no need to use a hammer to 
killa fly, and this principle might also apply to 
treating cancer. = 
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Orbital insight from an 
upright molecule 


A molecule standing on a metal surface has been found to emit electrons in 
the presence of an applied electric field. The emitted electrons produce an 
interference pattern reminiscent of a classic physics experiment. SEE LETTER P.573 


THOMAS GREBER 


uantum systems are described by 

wavefunctions, which have an ampli- 

tude and a phase: the square of the 
amplitude describes the probability of find- 
ing a particle in a given region of space-time, 
whereas the phase describes the sign (plus or 
minus) of the wavefunction. The ability to con- 
trol the phase of systems of electrons would 
open up opportunities for the development of 
quantum devices, and the first step in achiev- 
ing such control is to ascertain what the phase 
is in the first place. Unfortunately, the phase of 
an object's wavefunction is not directly observ- 
able. It is, however, possible to work out the rel- 
ative phase by observing interference patterns 
formed from the superposition (summation) 
of coherent electron waves (those between 
which there is a constant phase difference), by 
borrowing schemes from classic experiments 
that observed interference patterns in light, 
such as Thomas Young’s ‘double-slit’ experi- 
ment’ or Dennis Gabor’s demonstration of 
holography’. Writing in this issue (page 573), 
Esat et al.’ report a tabletop experiment that 
allows the phase of a molecular orbital to be 
determined from an interference pattern that 
arises as a result of electron emission from the 
molecule concerned. 

Esat and colleagues began their investigation 
by assembling a molecule on a silver surface, 
using the tip of a scanning tunnelling micro- 
scope at cryogenic temperatures (5 kelvin) to 
manipulate atoms and molecules with sub- 
nanometre precision. More specifically, they 
attached two silver atoms to one end ofa flat 
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Electron 
waves 


; oO 
Silver 


Figure 1 | A molecular electron emitter. Esat et al.’ 
prepared a metal-molecule complex that stands 
upright ona silver surface. When the authors 
applied a voltage between the surface and the tip 
ofa scanning tunnelling microscope, the molecule 
emitted electrons one at a time, producing electron 
waves. The spatial distribution of the resulting 
current contains patterns caused by interference 
between the electron waves. By analysing the 
patterns, the authors obtained information about 
the relative phase (plus or minus) of the region of 
the molecular orbital from which the electrons 
were emitted (the orbital is shown in blue and red; 
the colours represent the two relative phases). 
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pigment molecule known as 3,4,9,10-perylene- 
tetracarboxylic dianhydride (PTCDA) that 
was lying on the surface. They then lifted up 
the metal-molecule complex using the tip of 
their microscope, so that it stood upright on 
the surface (Fig. 1). 

The authors find that the complex is stable in 
this upright position — which might seem sur- 
prising to those in the know, because organic 
molecules preferentially lie flat on metallic sur- 
faces. It is not known which conformation of 
Esat and colleagues’ complex (flat or upright) 
is the more stable. However, their experimen- 
tal finding casts light on how molecules such 
as PTCDA can be stacked on metal surfaces, 
knowledge of which is essential for construct- 
ing nanoscale devices in which molecules are 
in electrical contact with metals. 

Erecting the molecule into this upright 
conformation allows it to perform a peculiar 
new function: it can emit electrons in the 
presence ofan electric field. When the authors 
positioned the microscope tip 7 nanometres 
above the standing molecule and applied 
a voltage of about 25 volts, they detected an 
electron current of 100 picoamps (1 pA is 
10° amps). Almost all of the electrons in the 
current pass across the sharp peak formed 
by the standing molecule. The electric field 
at the molecule’s apexes is much greater than 
it would be between flat electrodes, because 
it is enhanced by the curvature of the mol- 
ecule. Esat and co-workers show that the field 
enhancement is sufficiently high to allow 
electrons on the molecule to ‘tunnel’ into the 
surrounding vacuum, as measured in the field- 
emission current. 

The authors report that the electrons 
undergo a two-step tunnelling process to pass 
from the silver metal surface to the vacuum. 
First, a single electron tunnels from the surface 
into the lowest unoccupied molecular orbital 
(LUMO), where it adopts the orbital’s phase. 
In the second step, the electron is emitted at 
the edges of the molecule. The spatial distribu- 
tion of the emitted current contains patterns 
caused by the interference of each electron 
with itself. The existence of these features 
indicates that the emitted electrons ‘remem- 
ber the phase adopted from the part of the 
LUMO from which they were emitted — the 
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patterns wouldn't form unless the emissions 
had retained the orbital’s phase. 

Esat and colleagues’ experiment is remini- 
scent of Young’s double-slit experiment’, in 
which the patterns formed by the interference 
of light proved that light is a wave. But, in con- 
trast to Young’s experiment, the emission pat- 
terns observed by Esat et al. can be explained 
only if the electron wavefunction has a differ- 
ent sign depending on whether it is emitted 
from the top right or top left corners of the 
molecule. The relative phases of the electrons 
emitted from different sites of the molecule can 
thus be worked out from the spatial distribu- 
tion patterns of the emission current. 

The authors used an established method 
for moving atoms and molecules’ to produce 
their device. A complementary approach has 
previously been reported’ in which electrons 


are coherently emitted from carbon nanotubes. 
The physics underpinning the emission 
process is the same in both systems, but the 
approaches used to realize it are completely 
different: Esat and colleagues’ method can 
be thought of as a ‘bottom-up’ approach, in 
which the emitter is constructed from scratch, 
whereas the nanotube method was a ‘top- 
down’ approach in which nanotubes were 
painstakingly processed to allow the interfer- 
ence patterns to be observed and studied. The 
structures of Esat and colleagues’ emitters are 
therefore much more precisely defined and 
reproducible. 

The emission of electrons from a molecu- 
lar device could, in principle, be triggered and 
steered using a laser, as was recently demon- 
strated for larger emitters®. This would require 
the stability of the molecular emitters to be 


Weighing in on weight 
loss linked to cancer 


Weight loss and tissue wasting often occur in pancreatic cancer. Analyses of 
human and mouse data reveal a mechanism behind these events, and raise the 
question of whether tissue wasting affects cancer survival rates. SEE LETTER P.600 


J. MATTHIAS LOHR 


nearly symptom of pancreatic cancer 

is a profound loss in weight that can 

precede disease diagnosis by months’. 
Weight loss also occurs in other types of cancer, 
and is often associated with severe illness and 
a reduced quality of life. On page 600, Danai 
et al.” report an analysis of pancreatic cancer, 
using mouse models and clinical data, that 
illuminates the consequences of weight loss 
for cancer outcomes. 

Cachexia, the term used to describe the 
cancer-linked symptom of severe weight loss, 
has been recognized since at least the time of 
the ancient Greek physician Hippocrates. It is 
often a hallmark of cancers originating in the 
gut system’, and might manifest in changes 
such as loss of fat (adipose) tissue or skeletal- 
muscle wasting, which could arise if the body 
is using up the nutrient stores in such tissues. 
Cachexia is particularly common in people 
who have a type of cancer called pancreatic 
ductal adenocarcinoma. The mechanisms 
driving cachexia are not the same in all 
tumours’, but whether there are different types 
of cachexia depending on the tumour type or 
the stage of the cancer at which weight loss 
occurs remains to be determined. 

Two key mechanisms’ thought to drive 
cachexia are the breakdown of molecules in a 
process called catabolism, and inflammation, 
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which is controlled by the body’s immune sys- 
tem. The pancreas secretes digestive enzymes 
that break down complex, calorie-rich food 
to provide the components needed for tissue 
growth and maintenance’; this catabolism- 
supporting function is known as its exocrine 
role. Exocrine-system impairment causes 
malnutrition that can lead to life-threatening 
tissue wasting. However, the degree to which 
pancreatic exocrine-system abnormalities 
contribute to human cachexia was unknown. 
Human pancreatic cancer often occurs in 

a region of the organ 


pose = that can obstruct the 

This s triking main pancreatic duct, 
resultindicates hampering enzyme 
that cachexia release. This can lead 
does not to a situation termed 
drive cancer- pancreatic exocrine 
associated insufficiency, which 


results in nutrient- 
absorption deficien- 
cies and weight loss®. 
Cachexia in humans can be exacerbated if 
deficiencies occur in essential nutrients”, for 
example long-chain fatty acids and vitamin D, 
whose uptake is facilitated by pancreatic 
enzymes such as lipase. The administration 
of fatty acids increases skeletal-muscle mass 
in people with pancreatic cancer, particularly 
when this supplementation is combined with 
pancreatic enzymes’. 


mortality.” 


improved, but would be another step towards 
the development of phase control. Molecular 
emitters might eventually find applications in 
devices such as electron microscopes, detec- 
tors that identify the phase or spin of electrons, 
or even quantum computers. m 
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The research group that conducted the 
current study had previously’’ observed that 
tissue breakdown occurs before pancreatic- 
cancer diagnosis in humans and before the 
development of early-stage pancreatic can- 
cer in mice. To continue their investigation, 
Danai and colleagues studied pancreatic can- 
cer using genetically engineered mouse models 
of the condition'’”, and they used transplan- 
tation experiments to test whether tumour 
location affects wasting. They found that if 
pancreatic-tumour cells were transplanted 
into mice beneath the skin surface, adipose- 
tissue wasting did not occur, whereas wasting 
did occur if the cells were transplanted into 
the pancreas. This finding indicates that some 
aspect of the pancreatic environment has a 
key role in this phenomenon, and is consistent 
with the results of a previous study’ | However, 
that study also found that tissue wasting was 
promoted when tumour cells were introduced 
into the body cavity, suggesting that tumour 
presence at a non-pancreatic site can also 
trigger this phenomenon. 

Danai and colleagues’ metabolic investi- 
gations revealed that mice with pancreatic 
tumours used less oxygen and produced less 
carbon dioxide than did control mice lacking 
tumours. This suggested that the presence of 
the cancer might be linked to a decrease in the 
processes involved in food breakdown and 
nutrient adsorption. To investigate how the 
pancreatic-tumour environment might cause 
this early metabolic change and weight loss, 
the authors tested whether pancreatic exocrine 
insufficiency was responsible, given that this 
can occur in human pancreatic cancer®. When 
Danai and colleagues gave the mice pancreatic 
enzymes, the level of adipose-tissue wasting 
decreased, suggesting that pancreatic exo- 
crine insufficiency has a causal role in cachexia 
(Fig. 1). 

Danai and colleagues found that, although 
pancreatic-enzyme supplementation could 
limit the tissue wasting, the animals’ sur- 
vival rate did not improve. This striking 
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Figure 1 | Tissue wasting in pancreatic cancer. The pancreas secretes enzymes into the small intestine 
that aid the breakdown of food, enabling nutrients to be absorbed (black arrows). Abnormalities in this 
process can lead to weight loss and tissue wasting. Danai et al.” investigated the cause and consequences 
of the weight loss and adipose-tissue wasting that often occur early in pancreatic cancer. They observed 
that mice with pancreatic tumours had lower blood glucose levels, increased levels of adipose-tissue 
wasting and a decreased survival rate compared with control mice that did not have a pancreatic tumour. 
The authors tested whether feeding the mice pancreatic enzymes would result in any improvements, and 
found that it decreased adipose-tissue wasting and increased blood glucose. However, these changes did 
not increase the animals’ survival rate, providing insights into the debate about whether weight loss is 


linked to cancer mortality. 


result indicates that cachexia does not drive 
cancer-associated mortality. The result is also 
consistent with previous clinical evidence® 
that pancreatic-enzyme supplements do not 
improve survival in pancreatic cancer. More- 
over, when Danai et al. analysed clinical data 
to assess adipose-tissue wasting in 782 people 
with pancreatic cancer, they found that wast- 
ing did not correlate with poorer survival rates. 
However, it was previously reported” that the 
loss of skeletal muscle and adipose tissue is 
linked to worse cancer survival rates, so Danai 
and colleagues’ results call into question the 
idea that cachexia affects survival. 

As well as regulating exocrine function, the 
pancreas has endocrine functions — it pro- 
duces hormones that regulate metabolism. A 
key component of the endocrine system pro- 
duced by the pancreas is the hormone insulin. 
Insulin facilitates glucose uptake into cells, and 
its absence can cause diabetes. Diabetes can 
sometimes precede pancreatic-cancer diagno- 
sis by a year or two, and might be a red flag of 
trouble ahead“*. Moreover, abnormal glucose 
metabolism might contribute to adipose- and 
skeletal-tissue wasting’, and diabetes can 
cause exocrine insufficiency’®. 

Danai and colleagues observed lower insulin 
and glucose levels in the blood of their model 
mice compared with the levels in control 
mice, and this decrease in insulin and glucose 
might lead to increased breakdown of stored 
fats, which could, in turn, increase the level 
of tissue wasting. This potential connection 
between the endocrine and exocrine systems 
and weight loss is supported by studies in the 
fruit fly Drosophila melanogaster”. 

Much remains to be understood about the 


role of the exocrine and endocrine systems 
in pancreatic cancer. One way to address 
this might be to perform detailed gene- and 
protein-expression analyses to determine the 
signalling crosstalk between transplanted 
cancer cells and the surrounding healthy 
pancreas in the mouse model used by the 
authors. Another potential avenue of research 
would be to investigate pancreatic exocrine 
insufficiency at the time of cancer diagnosis, 
especially in people whose tumours do not 
block the main pancreatic duct. 

The authors did not investigate the role of 
inflammation in cancer-associated weight 
loss, but this is tricky to investigate because 
pancreatic tumours are associated with 
immunosuppression caused by factors such as 
the protein TGF-6. Inhibiting TGF-B reduced 
cachexia in a mouse model of pancreatic 
cancer’®, and there is circumstantial evidence 
that low-level inflammation contributes to 
pancreatic-exocrine insufficiency”. These 
observations provide tantalizing hints that 
inflammation warrants further investigation 
in this context. 

It is worth considering whether other 
mechanisms might contribute to cachexia. 
For example, appetite loss might in turn 
reduce enzyme output, so dietary intake could 
be another key factor. As work such as that of 
Danai and colleagues improves our under- 
standing of cachexia, the condition comes into 
focus as a distinct entity, rather than merely 
an early symptom of cancer. A goal for future 
research should be to delineate the interactions 
between exocrine and endocrine function and 
inflammation in cachexia. Although Danai 
and colleagues’ results cast doubt on whether 
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50 Years Ago 


The resignation of two matrons 
within a short space of time suggests 
that discontent among hospital 
staff is on the increase ... matrons 
simply do not wield today the 
power they used to. Together with 
senior nurses they are assuming 
more and more responsibility, but 
their opinions are not being taken 
into account. It would not be a gross 
exaggeration to say that the concept 
of an all-powerful, dictatorial 
matron is fast disappearing, and this 
is perhaps no bad thing, because no 
individual can successfully carry 
the burden of running a hospital. 
But the answer does not lie in the 
appointment of honorary members 
of hospital committees who may 

be highly capable managers and 
administrators in their own right, 
but who have no real knowledge 

of the problems of nursing staff. 
What seems to be happening is that 
these “amateurs” ... are overriding 
the people with professional 
knowledge. 

From Nature 29 June 1968 


100 Years Ago 


Every boy and girl at school who 
“does science” now learns that 
metric units are the universal 
medium of scientific expression, 
and is practised in their use ... 

A boy goes home at the end of term 
and tells his father that he has been 
doing science, weighing in grams, 
measuring lengths in centimetres, 
pressures in millimetres of mercury, 
and temperatures in degrees 
centigrade. Surely the most natural 
remark for any naturally minded 
parent to make is that his boy 

need not pay any attention to that, 
because, if it had any bearing at 

all upon practical life, he would 
certainly have been taught to use 
pounds or grains, inches, and 
Fahrenheit degrees, and not the 
outlandish things that nobody uses 
after he has left school. 

From Nature 27 June 1918 
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cachexia affects survival in cancer, if progress 
could be made to stop tissue wasting, it would 
substantially alleviate the disease burden for 
patients. = 


J. Matthias Lohr is in the Department of 
Cancer Medicine, Karolinska University 
Hospital, and in the Department of Clinical 
Intervention and Technology, Karolinska 
Institutet, 141 86 Stockholm, Sweden. 
e-mail: matthias.lohr@ki.se 


1. DiMagno, E. P. Ann. Oncol. 10 (Suppl. 4), 


Inretrospect 


$140-S142 (1999). 
. Danai, L. V. et al. Nature 558, 600-604 (2018). 
. Tisdale, M. J. Nature Rev. Cancer 2, 862-871 
(2002). 
4. Fearon, K., Arends, J. & Baracos, V. Nature Rev. Clin. 
Oncol. 10, 90-99 (2013). 
. Milton, K. J. Nutr. 133, 3886S-3892S (2003). 
. Vujasinovic, M., Valente, R., Del Chiaro, M., Permert, J. 
& Lohr, J. M. Nutrients 9, 183 (2017). 
7. Murphy, R.A. et al. Lipids 47, 363-369 (2012). 
8. Dev, R. et al. Oncologist 16, 1637-1641 (2011). 
9. Abe, K. et al. Anticancer Res. 38, 2369-2375 (2018). 
10.Mayers, J. R. et al. Nature Med. 20, 1193-1198 
(2014). 
11.Michaelis, K. A. et al. J. Cachexia Sarcopenia Muscle 
8, 824-838 (2017). 
12.Hingorani, S. R. et a/. Cancer Cell 4, 437-450 (2003). 


Wh 


Oo 


Twenty years of 
network science 


The idea that everyone in the world is connected to everyone else by just six degrees 
of separation was explained by the ‘small-world’ network model 20 years ago. 
What seemed to be a niche finding turned out to have huge consequences. 


ALESSANDRO VESPIGNANI 


the ‘small-world’ model of networks, 

which describes the clustering and short 
separations of nodes found in many real- 
life networks. I still vividly remember the 
discussion I had with fellow statistical physi- 
cists at the time: the model was seen as sort 
of interesting, but seemed to be merely an 
exotic departure from the regular, lattice-like 
network structures we were used to. But the 
more the paper was assimilated by scientists 
from different fields, the more it became clear 
that it had deep implications for our under- 
standing of dynamic behaviour and phase 
transitions in real-world phenomena ranging 


E 1998, Watts and Strogatz! introduced 


from contagion processes to information 
diffusion. It soon became apparent that the 
paper had ushered in a new era of research that 
would lead to the establishment of network 
science as a multidisciplinary field. 

Before Watts and Strogatz published their 
paper, the archetypical network-generation 
algorithms were based on construction 
processes such as those described by the 
Erdés—Rényi model’. These processes are 
characterized by a lack of knowledge of the 
principles that guide the creation of connec- 
tions (edges) between nodes in networks, 
and make the simple assumption that pairs 
of nodes can be connected at random with a 
given connection probability. Such a process 
generates random networks, in which the 


& 


Figure 1 | The small-world network model. In 1998, Watts and Strogatz! described a model that 
helps to explain the structures of networks in the real world. a, They started with a regular network, 
depicted here as nodes connected in a triangular lattice in which each node is connected to six other 
nodes. b, They then allowed links between nodes to be rewired at random, with a fixed probability 
of rewiring for all links. As the probability increases, an increasing number of short cuts (red lines) 
connect distant nodes in the network. This generates the small-world effect: all nodes in the network 
can be connected by passing along a small number of links between nodes, but neighbouring nodes 
are connected to one another, forming clustered cliques. (Adapted from Samay/Vespignani.) 
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average path length between any two nodes 
in the network — measured as the smallest 
number of edges needed to connect the nodes 
— scales as the logarithm of the total number 
of nodes. In other words, randomness is suffi- 
cient to explain the small-world phenomenon 
popularized as ‘six degrees of separation”: the 
idea that everyone in the world is connected to 
everyone else through a chain of, at most, six 
mutual acquaintances. 

However, random construction fell short 
of capturing the local cliquishness of nodes 
observed in real-world networks. Cliquishness 
is measured quantitatively by the clustering 
coefficient of a node, which is defined as the 
ratio of the number of links between a node's 
neighbours and the maximum number of such 
links. In real-world networks, node clustering 
is clearly exemplified by the axiom ‘the friends 
of my friends are my friends’: the probability 
of three people being friends with each other 
ina social network, for example, is generally 
much higher than would be predicted by a 
model network constructed using the simple, 
stochastic process. 

To overcome the dichotomy between 
randomness and cliquishness, Watts and 
Strogatz proposed a model whose starting 
point is a regular network that has a large 
clustering coefficient. Stochasticity is then 
introduced by allowing links to be rewired at 
random between nodes, with a fixed probabil- 
ity of rewiring (p) for all links. By tuning p, the 
model effectively interpolates between a reg- 
ular lattice (p> 0) and a completely random 
network (p> 1). 

Atvery small p values, the resulting network 
is a regular lattice and therefore has a high 
clustering coefficient. However, even at small 
p, short cuts appear between distant nodes in 
the lattice, dramatically reducing the average 
shortest path length (Fig. 1). Watts and Stro- 
gatz showed that, depending on the number of 
nodes’, it is possible to find networks that have 
a large clustering coefficient and short aver- 
age distances between nodes for a broad range 
of p values, thus reconciling the small-world 
phenomenon with network cliquishness. 

Watts and Strogatz’s model was initially 
regarded simply as the explanation for six 
degrees of separation. But possibly its most 
important impact was to pave the way for 
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studies of the effect of network structure on a 
wide range of dynamic phenomena. Another 
paper was also pivotal: in 1999, Barabasi and 
Albert proposed the ‘preferential-attachment’ 
network model’, which highlighted that the 
probability distribution describing the number 
of connections that form between nodes in 
real-world networks is often characterized 
by ‘heavy-tailed’ distributions, instead of the 
Poisson distribution predicted by random 
networks. The broad spectrum of emergent 
behaviour and phase transitions encapsulated 
in networks that have clustered connectedness 
(as in Watts and Strogatz’s model) and hetero- 
geneous connectedness (as in the preferential- 
attachment model) attracted the attention of 
scientists from many fields. 

A string of discoveries followed, highlighting 
how the complex structure of such networks 
underpins real-world systems, with implica- 
tions for network robustness, the spreading 
of epidemics, information flow and the syn- 
chronization of collective behaviour across 
networks”*, For example, the small-world con- 
nectivity pattern proved to be the key to under- 
standing the structure of the World Wide Web” 
and how anatomical and functional areas of the 
brain communicate with each other’®. Other 
structural properties of networks came under 
the microscope soon after’, such as modu- 
larity and the concept of structural motifs, all 
of which helped scientists to characterize and 
understand the architecture of living and arti- 
ficial systems, from subcellular networks to 
ecosystems and the Internet. 

The current generation of network research 
cross-fertilizes areas that benefit from unprec- 
edented computing power, big data sets and 
new computational modelling techniques, and 
thus provides a bridge between the dynamics 
of individual nodes and the emergent proper- 
ties of macroscopic networks. But the imme- 
diacy and the simplicity of the small-world and 
preferential-attachment models still under- 
pin our understanding of network topology. 
Indeed, the relevance of these models to differ- 
ent areas of science laid the foundation of the 
multidisciplinary field now known as network 
science. 

Integrating knowledge and methodologies 
from fields as disparate as the social sciences, 
physics, biology, computer science and applied 
mathematics was not easy. It took several years 
to find common ground, agree on definitions 
and reconcile and appreciate the different 
approaches that each field had adopted to 
study networks. This is still a work in progress, 
presenting all the difficulties and traps inher- 
ent in interdisciplinary work. However, in the 
past 20 years a vibrant network-science com- 
munity has emerged, with its own prestigious 
journals, research institutes and conferences 
attended by thousands of scientists. 

By the 20th anniversary of the paper, more 
than 18,000 papers have cited the model, which 
is now considered to be one of the benchmark 
network topologies. Watts and Strogatz closed 


their paper by saying: “We hope that our work 
will stimulate further studies of small-world 
networks.” Perhaps no statement has ever been 
more prophetic. = 
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A complex story of 
receptor signalling 


G-protein-coupled receptors activate different G-protein types to trigger 
divergent signalling pathways. Four structures of receptor-G- protein complexes 
shed light on this selectivity. SEE ARTICLES P.547, P.553 & P.559 & LETTER P.620 


MICHAEL J. CAPPER & DANIEL WACKER 


bout one-third of all drugs, including 

opioid painkillers, antihistamines and 

many antipsychotics, target members 
of a family of proteins called G-protein- 
coupled receptors (GPCRs)’. This reflects the 
fact that GPCRs are important in almost all 
aspects of human physiology, and suggests 
that many more of them will be promising 
drug targets for numerous diseases. GPCRs 
span the cell membrane and convert myriad 
extracellular signals, including neurotrans- 
mitter molecules, hormones, and even light, 
into a cellular response by activating cellular 
G proteins and other transducer proteins. 
Four papers” in this issue help to unravel the 
mystery of how GPCRs selectively activate a 
particular group of G proteins known as G;,,; 
and provide clues that might aid the design of 
improved GPCR-targeting drugs. 

Although more than 800 GPCRs are encoded 
in the human genome, they couple to only a 
small number of intracellular signal trans- 
ducers, including 16 Ga proteins’. The latter 
proteins assemble with Gf and Gy proteins to 
form heterotrimeric G proteins. The G-protein 
complex disassembles on activation by GPCRs, 
whereupon the various subunits activate differ- 
ent signalling pathways. For instance, stimu- 
latory Ga proteins (known as G,) increase 
cellular levels of cyclic AMP molecules, which 
regulate various cellular processes. Structures 
of G,-bound GPCRs have been reported”* that 
have begun to elucidate the general activa- 
tion mechanism of Ga proteins, and of G, in 


particular. But much less is known about how 
GPCRs selectively activate inhibitory Ga pro- 
teins, which include G,,, G,, G,; and G,, and are 
collectively known as G;,,. 

The four papers in this issue report structures 
of G,,,-bound GPCRs obtained using cryo- 
electron microscopy: Koehl et al.’ (page 547) 
report the structure of the 1-opioid receptor 
bound to G,,; Draper-Joyce et al.* (page 559) 
describe the adenosine A, receptor in complex 
with G,,; Garcia-Nafria et al.* (page 620) report 
the 5HT,, receptor bound to G,; and Kang 
et al.° (page 553) reveal the structure of the light 
receptor rhodopsin in complex with G;,. The 
G-protein activation cycle involves the bind- 
ing and release of nucleotides to and from the 
G proteins, and all of the reported structures 
capture the receptors bound to the nucleotide- 
free state of their respective G proteins. 

In some respects, the four structures are 
similar to those of the previously published 
GPCR-G, complexes”*, probably because 
G,- and G,,,-containing complexes have the 
same overall conformation at the stage of the 
G-protein activation cycle captured by the 
structures. Nevertheless, the G,,,-containing 
structures reveal striking differences at the 
receptor-G-protein interface when com- 
pared with the G,-containing structures. For 
example, there are no interactions between 
the receptors and the Gf subunits in the 
G,,,-containing structures. 

The four structures uncover several key 
interactions at the GPCR-G,,, interface medi- 
ated by the a5 helix — an a-helix structure 
in the carboxy terminus of Ga subunits. It is 
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Figure 1 | Structural differences in complexes of G-protein-coupled receptors (GPCRs) with 
G proteins. GPCRs are transmembrane receptors that activate cellular signalling pathways by binding 
to G proteins, which have three subunits: a, B and y. Stimulatory Ga proteins are known as G,, whereas 


inhibitory Ga proteins (G, and G, proteins) are collectively known as G 


. Many GPCRs selectively 


i/o 


bind to G, or G,,,, but the basis of this selectivity was unknown. a, This cartoon shows the positions of 
three a-helices in complexes of GPCRs with G,-containing G proteins, based on previously reported 
structures”*. TM6 and TM7 are transmembrane helices in the GPCR, whereas a5 is in the carboxy 


terminus region of G,. b, Four papers” ° now report the structures of GPCRs in complex with G 


i/o 


proteins. Compared with a, the a5 helices are rotated and moved slightly towards TM7, and away from 
TM6. The outward displacement of TM6 is smaller than that in a. The smaller displacement of TM6 


might block the binding of G, proteins, thus explaining how GPCRs bind selectively to G 


known that the binding of this helix to the 
receptor’s cytoplasmic site triggers confor- 
mational rearrangements in Ga that cause the 
release of a nucleotide (GDP) bound to Ga, 
initiating G-protein activation’. The position- 
ing of the G,,, a5 helices in the new structures 
is different from that of the analogous heli- 
ces in the GPCR-G, complexes. Specifically, 
the G;,, a5 helices are rotated and translated 
slightly towards transmembrane helix (TM) 7 
in the GPCR and away from TM6. Moreover, 
TM6 is displaced outwards from the receptor 
core by a smaller amount than occurs in the 
G,-bound GPCRs (Fig. 1). The authors of all 
four papers therefore suggest that the smaller 
displacement of TM6 might preclude binding 
of G, and help to explain how GPCRs can bind 
selectively to G,,, proteins. 

The difference in the positioning of the 
a5 helices seems to be due to the G, a5 helices 
containing bulkier amino-acid residues than 
those of the G,,, a5 helices. Moreover, Kang 
et al. analysed and compared the amino-acid 
sequences for TM6 in the G,- and G,,,-coupled 
receptors, and suggest that the different pat- 
terns of hydrophobic and hydrophilic residues 
observed in the two systems might affect the 
amount of displacement of TM6, and thus 
contribute to G,,, specificity. 

Comparison of the four GPCR-G,,, 
structures reveals considerable structural 
plasticity at the interface. This is not sur- 
prising, given that G;,, proteins are engaged 
by hundreds of GPCRs that have diverse 
structures and sequences. Draper-Joyce 
et al. thus suggest that G-protein specificity 
is not necessarily encoded by evolutionar- 
ily conserved interactions between specific 
amino-acid residues, but might be based on 
“pocket complementarity’, in which confor- 
mational rearrangements produce regions on 
the GPCR cytoplasmic site that are conducive 
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to the binding of specific G proteins. Further 
evidence for this comes from the fact that all 
the structures of the GPCR-G,,, complexes 
display markedly smaller GPCR-G protein 
interfaces than do the structures of GPCR-G, 
complexes. This is particularly pronounced 
for the 5-HT,, receptor-G, interface surface, 
which Garcia-Nafria et al. report has an area 
of 822 square angstroms; this compares with 
1,260 A’? and 1,135 A? for the interfaces in the 
G,-bound B,-adrenergic’ and adenosine Aj, 
receptors’, respectively. 

Finally, Koehl et al. report subtle, yet 
potentially crucial, differences in the confor- 
mations of G,, and G, that occur during the 
transition between 
the GDP-bound and 


“The G-protei 
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might also contribute 
to the G-protein specificity of GPCRs. 

This body of work provides a key step 
towards delineating the molecular mecha- 
nisms by which GPCR conformations drive 
the activation of one signalling pathway in 
preference to another. Many more such struc- 
tures are sure to follow, and will probably 
reveal structural hallmarks that drive GPCR 
coupling to other G proteins and signal trans- 
ducers, such as arrestin proteins. However, 
as the authors of all four papers point out, 
these studies provide only snapshots of the 
G-protein activation pathway, and are thus 
incomplete. The coupling specificity of GPCRs 


depends on several factors not addressed by 
the new structures, including the pre-coupling 
of G proteins” (a preliminary step in which 
GPCRs and G proteins associate with each 
other, before actually coupling), and the bind- 
ing of the GDP-bound form of G proteins". 
The lifetimes of distinct receptor conforma- 
tions can also determine the specificity of 
GPCRs for transducers!””, adding a kinetic 
dimension to GPCR signalling that needs to 
be considered. 

A comprehensive molecular model of GPCR 
specificity for G proteins and transducers would 
not only improve our understanding of how 
GPCRs elicit complicated signals involving 
multiple, occasionally intersecting, pathways, 
but also facilitate the design of better drugs 
that target GPCRs. In particular, it could allow 
the structure-based design of drugs that selec- 
tively activate or inhibit particular signalling 
pathways, thereby making them safer and more 
effective than currently available therapeutics. 

For example, the painkilling properties 
of opioid medications such as morphine are 
thought to arise from the activation ofa G, pro- 
tein by the p1-opioid receptor, whereas coupling 
of the receptor to arrestin probably causes the 
drugs addictive properties and the — often 
fatal — depression of respiratory functions. 
Much effort has thus been dedicated to design- 
ing opioid compounds that provide pain relief, 
but that reduce the risk of addiction or over- 
dose. A flurry of structures of isolated GPCRs 
has already greatly facilitated the discovery of 
compounds that bind to the receptors, and that 
are useful tools for laboratory studies“. But it is 
the structures of GPCR signalling complexes 
that will allow the rational design of pathway- 
selective drugs. After all, GPCR signalling is, 
literally, a complex story. = 
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Triggers of tree mortality under drought 


Brendan Choat!*, Timothy J. Brodribb?, Craig R. Brodersen?, Remko A. Duursma!, Rosana Lopez!* & Belinda E. Medlyn! 


Severe droughts have caused widespread tree mortality across many forest biomes with profound effects on the function 
of ecosystems and carbon balance. Climate change is expected to intensify regional-scale droughts, focusing attention 
on the physiological basis of drought-induced tree mortality. Recent work has shown that catastrophic failure of the 
plant hydraulic system is a principal mechanism involved in extensive crown death and tree mortality during drought, 
but the multi-dimensional response of trees to desiccation is complex. Here we focus on the current understanding of 
tree hydraulic performance under drought, the identification of physiological thresholds that precipitate mortality and 
the mechanisms of recovery after drought. Building on this, we discuss the potential application of hydraulic thresholds 


to process-based models that predict mortality. 


stocks and have a key role in hydrological and nutrient cycles”. 

They also provide a wide array of ecosystem services and are vital 
for maintenance of biodiversity. While forests continue to face pressure 
from expanding human populations, which drive changes in land use and 
deforestation, the threat posed by climate change is less easily quantified. 
Evidence from a range of sources suggests that rising atmospheric CO 
concentrations have benefited forests, with CO, fertilization enabling an 
increased leaf area index*, enhanced water-use efficiency’ and greater 
uptake of carbon globally°. However, extreme climate events, such as 
heat waves, droughts, fires and storms, have the potential to offset these 
benefits, causing widespread tree mortality and a net loss of CO} into the 
atmosphere. Although forests are vulnerable to a wide range of extreme 
climate events, drought and associated disturbances have the greatest 
effect globally®. Recent projections’ indicate that land surface warming 
may lead to longer and more intense droughts, which has focused con- 
cern on this area of research and the need for accurate predictions of 
the effects of drought on forest ecosystems. In this Review, we examine 
the physiological response of trees to drought, focusing on new insights 
provided by rapid advances in our understanding of the hydraulic 
function of plants. 

Land plants require an efficient long-distance transport pathway to 
lift water from the soil to the leaves at a rate that satisfies transpira- 
tion®. In trees, the xylem tissue (wood) supplies water for all aspects of 
plant function, including photosynthesis, growth and reproduction. 
Damage to this hydraulic supply network as a consequence of severe 
water stress has been identified as a key mechanism that is involved 
in tree mortality during drought®"!!. Recent experimental work has 
quantitatively linked hydraulic failure thresholds to plant mortality!*", 
and field studies have demonstrated that hydraulic failure is a primary 
pathway for extensive canopy death or plant mortality during natural 
drought events'4-!”, 

A number of other co-contributing factors may also have a role in the 
death of trees during natural droughts’*. In the absence of catastrophic 
hydraulic failure, partial disruption of water transport and the regula- 
tion of water loss from plants during drought may lead to an increased 
likelihood of mortality through the depletion of carbohydrate reserves 
used in respiration and increased vulnerability to pests and pathogens"? 
Therefore, even in cases of co-morbidity, plant hydraulic traits occupy 
a central role in determining survival during drought and the effects of 
drought on carbon dynamics. 


HF orests account for approximately 45% of global terrestrial carbon 


Here, we cover recent progress in our understanding of plant hydraulic 
response to drought and the physiological mechanisms that govern recovery 
of hydraulic function after drought. Although recent advances have 
crystallized our understanding of plant hydraulic function and the 
consequences of vascular impairment caused by drought stress, many 
challenges remain. We evaluate recent attempts to integrate the hydraulic 
traits of plants into process-based models of tree mortality with an 
emphasis on major knowledge gaps. 


Drought and forest mortality 

The effect of future droughts will almost certainly be worsened by 
increases in air temperature associated with global warming; when 
natural droughts occur they will set in more quickly and be of greater 
intensity’. Higher temperatures will usually result in greater evapo- 
transpiration (the sum of evaporation and plant transpiration), thus 
drying soil and plants more quickly than would be the case at lower 
temperatures’. Droughts of this nature, termed ‘global change-type 
droughts; have had severe effects on exposed ecosystems including 
mass tree mortality*?!. 

Globally, drought is the most widespread stress factor that affects 
forest carbon balance® with the potential to cause pronounced 
depressions in gross primary productivity at regional and conti- 
nental scales”. The most notable effects of drought are manifested 
in regional-scale forest mortality events, which can kill millions of 
trees within short timescales. Recent high-profile examples include 
extreme droughts in Texas and California, which are estimated to 
have killed 300 million and 102 million trees, respectively”**®, Mass 
tree mortality due to drought is not restricted to arid regions, having 
been documented across many forest biomes including cool tem- 
perate and tropical forests!*"!®?78. In tropical northern Australia, 
the sudden die-off of more than 7,000 ha of mangrove forest in 2015 
was attributed to drought and extreme temperatures”*. Although 
such concentrated mortality events are yet to be observed in many 
of the world’s most productive tropical ecosystems, drought events 
in tropical rainforests (for example, the 2005 Amazon drought) 
have resulted in marked increases in stem mortality and loss of 
aboveground biomass”’. Mortality is often skewed towards young 
trees but recent evidence suggests that large, old trees are also 
vulnerable*”?!. Loss of large trees is particularly concerning because 
they have a critical ecological role and have the largest biomass and 
storage of carbon. 
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Against this backdrop, it is essential to improve the accuracy with 
which we can predict the response of trees to drought to understand 
the resilience of forests under future climate regimes. At present, mor- 
tality is not well-represented in vegetation models, owing mostly to 
gaps in our understanding of physiological mechanisms and a lack of 
appropriate thresholds with which to parameterize these models. We 
therefore turn our attention to how these problems may be resolved. 


Drought and hydraulic failure in trees 

As with all vascular plants, trees prevent desiccation injury by using an 
intricate plumbing system of hollow dead cells (vessels or tracheids) 
to transport water from the soil to the leaves. Xylem transport relies 
on an elegant mechanism whereby liquid water is held under tension, 
enabling trees to lift vast volumes of water to the canopy at little ener- 
getic cost??,. However, liquid water under tension exists in a metastable 
state, similar to that of a superheated liquid*?. In this state, water is 
prone to cavitation, a sudden phase change from liquid water to gas 
that creates a bubble (embolism). These gas emboli block water flow 
through xylem conduits and reduce the delivery of water to the can- 
opy and regenerative tissues (that is, apical and cambial meristems)®. 
Drought leads to higher xylem tensions and an increased probability 
that emboli will spread throughout the xylem network causing systemic 
vascular dysfunction'?34, 


Phases of drought stress and the response of plants 
During drought, reduced precipitation leads to declines in soil mois- 
ture, which are often accompanied by higher temperatures and 
increased evaporative demand from the atmosphere. These factors 
combine to induce water stress in plants, which is manifested as 
increased tension in the xylem sap. Water stress is measurable in plants 
as xylem water potential (W,), a variable that is primarily determined by 
pressure in the xylem fluid and becomes increasingly negative during 
drought*?*». As plants desiccate, the loss of cell turgor causes stomatal 
pores on the leaf surface to close, markedly slowing plant dehydration 
and the rate of decrease in Y. Most recent studies indicate that sto- 
mata in trees close before reaching the threshold Y% at which significant 
cavitation is initiated, despite the negative consequences of stomatal 
closure*°*8 (Fig. 1). On short time scales, these consequences include 
a rapid cessation of photosynthetic CO, assimilation, loss of canopy 
evaporative cooling through transpiration and greater probability of 
photodamage***®. Over longer time scales, low photosynthetic rates 
associated with drought-induced stomatal closure can lead to depletion 
of non-structural carbohydrate pools, which interferes with transloca- 
tion of sugars through the phloem!!*!” and the production of chem- 
ical defence compounds needed to prevent herbivory and disease!*’. 
The fact that stomatal closure generally occurs before the initiation of 
cavitation despite these costs suggests that avoidance of xylem cavita- 
tion is of paramount importance for the long-term survival of trees. 
After stomatal closure, Y% continues to slowly decrease, becoming 
more negative as water is lost through cuticular conductance, stomatal 
leakiness** and other tissues such as bark“. At the same time, hydrau- 
lic conductance may decrease throughout the hydraulic pathway of 
the plant through a number of biophysical and physiological mecha- 
nisms, including reversible collapse of leaf veins*®, regulation of aqua- 
porins in cell membranes*”*8 and the formation of cortical lacuna in 
fine roots”. Rates of water loss during this phase are typically in the 
order of 100-1,000-fold less than when the stomata are fully open’ 
and decreases in Y% are further buffered by the release of internally 
stored water™. If drought persists, Y% will ultimately reach a critical 
threshold at which emboli begin to propagate through the xylem®*". 
This process occurs throughout the hydraulic pathway including roots, 
stems and leaves***849>3 (Fig. 2). Because emboli greatly reduce water 
delivery to the canopy, this hydraulic dysfunction can cause patchy 
branch death and pronounced reductions in canopy leaf area**. During 
intense droughts, emboli spread throughout the water transport net- 
work, causing systemic failure of the vascular system*. In the face of 
continuing drought and high evaporative demand, systemic vascular 
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dysfunction may cause rapid mortality of the whole plant through 
desiccation'*'>!°and death of the meristematic tissue in the cambium 
and apical meristems. 


Hydraulic traits of trees and adaptations to drought 

The risk of hydraulic failure is an unavoidable consequence of trans- 
porting water under tension, and thus forms a fundamental axis of 
selection in the evolution of trees***”. Strategies to preserve the integrity 
of the plant vascular system in trees are diverse, but all revolve around 
a simple framework defined by two constraints: (1) the physical limits 
of the vascular system; and (2) the capacity to maintain plant water 
potential within these functional limits. These two attributes dictate 
how rapidly plant tissues will dehydrate during a drought and the spe- 
cific thresholds at which water stress will translate into hydraulic failure 
and mortality>”>*. 

Although it is possible to characterize a general sequence of events 
that describe the response of vascular plants to drought, the traits 
that define this response vary across species and environments**'. 
Recent studies have illustrated the enormous variation in vulnerability 
to xylem cavitation across tree species, with changes in xylem 
vulnerability correlated to mean annual precipitation and aridity of 
their growth environment™. Species are typically compared by the 
W, value at which a 50% loss of hydraulic conductance occurs (Wp), 
although other reference points may have more physiological impor- 
tance, for example, Weg (Fig. 1). Differences in vulnerability are driven 
by the anatomical features of the xylem, including conduit dimensions, 
network organization and the porosity of primary cell walls (pit mem- 
branes) that limit the spread of gas between conduits®*. These features 
control the critical Y, at which gas will penetrate pit membranes, causing 
cavitation in adjacent conduits and the spread of embolism through 
the xylem’. However, vulnerability to cavitation does not determine 
drought tolerance in itself. The probability of reaching the critical 
threshold and the length of time it takes for this to occur are deter- 
mined by the interaction of a number of associated physiological and 
morphological traits (Fig. 3). 

The multi-dimensional nature of such trait interactions has enabled 
vascular plants to inhabit nearly every terrestrial habitat on Earth and 
enabled a huge number of possible morphological and physiological 
solutions to tolerating drought. For instance, variation in the vulnera- 
bility of plants is often high within communities, particularly in drier 
habitats, indicating that vulnerability and aridity are decoupled in some 
cases™. This decoupling results from water-stress avoidance strategies 
that are used by some species, such as deep root systems or drought 
deciduousness, that allow them to maintain a higher W, during drier 
periods. Although this complexity makes the development of models 
challenging, a suite of well-studied traits that are mechanistically linked 
to drought tolerance have now emerged (Supplementary Table 1) and 
represents a promising direction for future research. Recent analyses 
have suggested that these traits often vary in a coordinated fashion 
that allows the benefits of photosynthetic carbon gain to be balanced 
against the risks of a decrease in YW, and the occurrence of hydraulic 
failure**. Thus, much of the complexity of trait interactions may col- 
lapse onto a single axis that defines a spectrum of drought tolerance 
strategies®. 

Ultimately, we are interested in predicting when a plant will die as 
a result of drought stress. Vulnerability to cavitation has emerged as a 
key physiological trait that is associated with mortality, and hydraulic 
failure represents a critical point in the drought response pathway. 
Species-specific tree hydraulic limitations provide a powerful mecha- 
nistic explanation for the observation that drought mortality is occur- 
ring across forest biomes, independent of the mean rainfall at any site. 
A recent data synthesis demonstrated that the majority of plant species 
converge to narrow hydraulic safety margins, that is, the buffer between 
minimum water potential experienced by the plant (Yin) and the 
threshold Y, for rapid loss of vascular function caused by cavitation“. 
Because Win integrates many important aspects of plant structure 
(for example, rooting depth) and physiology (for example, stomatal 
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Fig. 1 | Phases of drought response in plants. a, Time series of transverse 
slices through the xylem tissue obtained by X-ray microtomograpy show 
the spread of gas emboli through the xylem with increasing drought stress 
(left to right). In each slice, water-filled vessels are seen as bright circles 
whereas vessels that contain gas emboli are black. During severe drought, 
almost all vessels become gas-filled, which leads to whole-plant mortality 
(right). b, During the first phase, stomata close to limit water loss and 
delay the decrease in xylem water potential (blue line). After stomata close, 
water continues to be lost at a much lower rate via cuticular conductance. 


behaviour) in relation to the environment, the narrow safety margins 
that are found across forest types offer an important insight into plant 
ecology, one that suggests that the hydraulic strategies of plants are 
finely tuned to their environment, allowing for maximum carbon gain 
but exposing plants to the risk of hydraulic failure during drought. It 
also suggests a generally ‘risky’ strategy in which plants have limited 
physiological potential to respond to rapid changes in the environment. 
This exacerbates the threat posed by increased occurrences of extreme 
drought under climate change. Indeed, drought mortality events across 
forests from a broad geographical and climatic range have been linked 
quantitatively to hydraulic traits and xylem cavitation®’. Examples 
come from tropical rainforest®’, temperate forests'*'®, chaparral'*!” 
and desert woodlands''. Many of these studies show differential species 
mortality within each forest type, allowing us insights into the potential 
winners and losers under future drought regimes. 


Plasticity and genetic variation in hydraulic traits 

Although much is known about the variability in hydraulic traits among 
plant species, far fewer data are available to quantify within-species 
variation. The capacity of trees to alter phenotype (that is, phenotypic 
plasticity) and the amount of genetic diversity within a population are 
key variables for the ability of species to cope with rapid climate change. 


At a critical threshold, cavitation increases rapidly and gas emboli spread 
throughout the xylem (red line). Increasing levels of embolism are shown 
as the proportional loss of xylem hydraulic conductance. ‘Vulnerability 
curve’ analysis translates the physics of cavitation to a quantification of 
species susceptibility to cavitation during exposure to water stress. These 
mortality thresholds have been found to correspond to between 50% (Wo) 
and 88% (Wg) loss of hydraulic function in conifers and angiosperms, 
respectively. c, A general scheme for the magnitude and timing of response 
processes with increasing drought stress. 


It is unlikely that trees will be capable of adapting to sudden increases 
in the aridity of their environment through evolutionary mechanisms, 
because of their long generation cycle and inability to migrate away 
from stress. On the other hand, adaptive plasticity of hydraulic traits 
may enable the acclimatization of entire populations within the neces- 
sary timescales. Quantifying the extent of plasticity in hydraulic traits 
is therefore an essential component for the prediction of the tolerance 
ranges and resilience to drought of different species. However, com- 
prehensive datasets that examine the genetic variation and phenotypic 
plasticity of vulnerability to cavitation have only recently become avail- 
able and are limited to a few species. A study of 513 genotypes of the 
widespread pine species Pinus pinaster showed low genetic variation 
of Yo between climatically contrasting populations and very limited 
phenotypic plasticity. These results suggest that Yo may be a canalized 
trait in pines, with little capacity to enable short-term acclimatization 
and adaptive plasticity. Angiosperm species have a higher potential for 
phenotypic plasticity and adaptive variation between populations”, 
although the observed shifts in vulnerability are often small in magni- 
tude relative to changes in WY, that are expected to occur during severe 
drought. 

Long-term manipulative experiments suggest that structural 
acclimatization, that is, changes in the allocation pattern between 
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Fig. 2 | Non-invasive imaging techniques have provided new insights 
into embolism formation and spread in the xylem. a, Mapping the spread 
of embolism in leaf vein networks during dehydration with transmitted 
light. Left, transmitted light images highlighting the vein network. Middle, 
image subtraction reveals embolism propagating from the midrib into 

the secondary and tertiary venation. Right, a colour map of all cavitation 
events recorded during desiccation. b, Three-dimensional rendering from 
a X-ray micro-computed tomograph of a pine stem. Embolized tracheids 
can be seen clearly as a black void space surrounded by water-filled (grey) 
tracheids. c, Part of a root system rendered from a micro-computed 
tomograph showing embolized xylem vessels (red) in the main root axis 
and lateral roots during dehydration. Right, the root tissue has been made 
transparent to illustrate the pathway of embolized vessels. Images in a were 
reproduced with permission from Brodribb et al.>*. 


water-absorbing, -conducting and -transpiring tissues, is almost cer- 
tainly the dominant process by which plants adjust their hydraulic sys- 
tems in response to drought”’. Reductions in the leaf to sapwood area 
ratio and shoot to root ratio result in a greater capacity to supply water 
to the leaves and limit the drop in Yin, consistent with homeostasis of 
water transport, however these changes come at the cost of reductions 
in productivity’”. Reductions in the leaf to sapwood area ratio result in 
the maintenance ofa higher YW and a greater capacity to supply water 
to the leaves’*. These results are consistent with studies of intraspecific 
variation in hydraulic architecture across aridity gradients, which show 
changes in morphology and allocation patterns; however, little evidence 
of adaptive variation in vulnerability to cavitation has been found in 
these studies, even in species with a wide climate envelope’*”°. Further 
studies are clearly required to determine whether these patterns can be 
generalized, particularly in angiosperms, and what role the plasticity of 
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hydraulic traits may have in the capacity of plants to survive increased 
aridity. 


Predicting mortality from hydraulic thresholds 
Predictions of drought-induced forest mortality require a detailed 
understanding of the physiological underpinnings of tree death. 
Accordingly, this topic has received much attention in recent years and 
substantial progress has been made in our understanding of the mech- 
anisms of tree mortality!!”. It is clear that drought-associated forest 
mortality is complex and a number of interdependent mechanisms 
have important roles in this process. These mechanisms include failure 
of water transport in the xylem, depletion of carbohydrate reserves 
over prolonged drought*!’ and increased vulnerability to pests and 
pathogens’®”*. All mechanisms of drought-associated mortality revolve 
around the effects of stomatal closure and increasing xylem tension 
during water shortage. Hydraulic failure is the most fully elaborated 
mechanism and currently holds the most promise for predictive 
models. It is a relatively well-understood biophysical process that is 
amenable to modelling’’, with failure thresholds that can be readily 
established for a given species or population®””°, Accuracy and con- 
fidence in the vulnerability thresholds that are chosen to represent 
different species in predictive models are absolutely critical. Recent 
technical and theoretical advances in the science of plant hydraulics 
have provided new certainty in the quantitative nature of hydraulic 
failure**°>’8. We thus focus on hydraulic traits as a means to under- 
stand and predict patterns of tree mortality in response to drought 
while emphasizing that other thresholds can be incorporated as our 
understanding of them improves. Indeed, incorporation of hydrau- 
lics thresholds, such as turgor loss and stomatal closure, should assist 
greatly in predicting carbon dynamics under drought. 


Measuring hydraulic thresholds to mortality 

As noted above, tree species exhibit a large range in xylem vulnerabil- 
ity*', Clear links between xylem cavitation and tree death have been 
established in pot studies'®!*'? and natural systems'>”*°, suggesting 
that xylem vulnerability should undergo strong selection in species 
exposed to episodic water stress. Evidence of selection, or ecological 
sorting, can be seen in the distribution of species with regard to strong 
correlations between aridity and xylem vulnerability'!°°°™. The excit- 
ing implication of this work is that the vulnerability of xylem tissue to 
cavitation provides a measurable index of the capacity of a species to 
tolerate water stress during drought’’. 

Our ability to predict the level of water stress at which a plant will die 
based on functional traits has advanced considerably in recent years. 
The concept of a ‘lethal water potential’ for a given plant species or 
population has existed for some time, but only recently has hydraulic 
vulnerability been quantitatively linked to mortality’. In conifers, the 
Wsp of the stem xylem is strongly related to their ‘minimum recoverable 
water potential; essentially a physiological point of no return'**!. By 
contrast, the lethal water potential of angiosperm species is correlated 
with more complete hydraulic dysfunction, representing 80-100% loss 
of xylem hydraulic conductance (Y%)'%!°. The disparity in mortality 
thresholds between conifers and angiosperms may be related to the 
fundamental difference in xylem structure between these two groups 
and goes some way to explaining the generally larger hydraulic safety 
margins and more conservative stomatal behaviour exhibited by conifer 
species®**. Other hydraulic thresholds for mortality have been pro- 
posed, including a sustained loss of hydraulic conductance greater than 
60%%83, We note that these studies were based on modelled thresholds 
of whole-plant conductance in two conifer species and compare well 
to the Wo threshold that has been established for conifers experimen- 
tally. A recent data synthesis found that all studies reported a 60% or 
greater loss in hydraulic conductance at death with a mean loss of 83%. 
Therefore, although there are some discrepancies between proposed 
thresholds, it is clear that high levels of xylem embolism are linked with 
mortality. Modern techniques for the measurement of water potential 
and non-invasive visualization of embolism are providing more 
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Fig. 3 | Tree hydraulic traits associated with drought-induced mortality. 
Trees use a variety of interdependent and coordinated morphological, 
anatomical and physiological traits to mitigate water loss and the 
development of increasingly negative xylem sap pressures during 

drought. This includes tissue-specific traits that function in the unique 
microenvironment of roots, stems and leaves, as well as traits that are 
common among most tissue types in trees. Many structure-function 


34,55 


accurate measures of hydraulic failure thresholds**”’, which can then 


be used to parameterize models of tree mortality. 


Trait-based models of tree mortality 

Although the vulnerability of xylem to cavitation defines a threshold 
in water stress beyond which tree mortality will occur, the key issue 
for predicting mortality is the ability to translate meteorological data 
(for example, precipitation or evaporative demand) into plant water 
content or xylem tension®®. This calculation presents a number of chal- 
lenges: it requires knowledge of the volume of water that is available to a 
plant in the soil and internal reservoirs, as well as the rate of water loss 
through transpiration. Calculating the amount of water in the soil that 
is available to the plant is made difficult by a paucity of data relating 
to rooting depth of trees and the architecture of the roots. Simulating 
the rate of water loss is complicated by the active regulation of transpi- 
ration by stomata, differences in cuticular transpiration after stomatal 
closure and the degree of leaf shedding during drought. To date, most 
attempts to model or predict mortality use empirical relationships 
between observed mortality and climate extremes***°. Such empirical 
relationships, although they provide insights into the current drivers 
of mortality, may not function well in the future if plant sensitivities 
change over time or if novel climate conditions occur. For example, 
rising CO, concentrations may alleviate drought stress, whereas rising 
temperatures may exacerbate it®°. Empirical relationships may also fail 
in regions where long-term shifts towards a novel, drier climate are 
occurring. Process-based models are thus highly desirable*”. 

Recent progress in the understanding of the hydraulic mechanisms 
that lead to mortality, and quantification of the key plant traits that are 
involved, has led to the incorporation of plant hydraulics in a range of 
process-based vegetation models****”. The key elements of such mod- 
els are a description of the soil-to-leaf hydraulic pathway, incorporating 
soil, root, xylem and stomatal conductances, whole-plant capacitance 
and the vulnerability of the xylem to cavitation. Important plant traits 
that are required for parameterization include the response of stomata 
to decreasing water potential, the point at which leaf turgor is lost, 


REVIEW 


Morphological traits (shoot) 

¢ Stomatal anatomy 

e Leaf vein density 

¢ Total leaf area 

¢ Leaf shedding/drought deciduous 
¢ Leaf to sapwood area ratio 


Xylem anatomical traits 


¢ Xylem conduit size, number and 
connectivity 


¢ Pit membrane thickness/porosity 
© Wood density 


Morphological traits (root) 
© Root to shoot ratio 

¢ Rooting depth 

Fine root loss 


relationships exist between traits, for example, variation in xylem 
anatomical traits (pit membrane porosity, conduit size and connectivity) 
determine species and population-level vulnerability to cavitation. 

Note that this figure does not represent an exhaustive list of hydraulic 
traits relevant to the response of trees to drought and drought-induced 
mortality. 


saturated xylem hydraulic conductance and water potential thresholds 
of vulnerability to cavitation. Recent compilation efforts have made 
data for these traits available for a wide range of species”! and enabled 
hydraulic traits to be related to other aspects of the plant economic 
spectrum®®®. Incorporating trait variation in drought sensitivity 
among species or genotypes, and relating this variation to plant water- 
use strategies and other plant properties, promises to be an effective 
way forwards. 

Nonetheless, critical gaps remain in our ability to describe the 
hydraulic pathway and its eventual failure. Here we draw attention to 
several gaps that hinder model development and parameterization and 
that have received comparatively little attention: (1) the dynamics of 
canopy leaf area during drought; (2) the dependence of plant water 
status on soil water potential; and (3) the process of plant desiccation 
in very dry soil, when root water uptake is no longer possible. 

First, leaf shedding occurs in many ecosystems during drought”, 
and can mitigate water stress to the remaining foliage”’, slowing the 
rate of desiccation (the ‘hydraulic fuse’ hypothesis**). However, drought 
deciduousness is, as yet, poorly captured in models”. Recently, it has 
been demonstrated that representing vegetation as a set of compet- 
ing plant types with varying degrees of drought deciduousness leads 
to a marked improvement in modelled leaf area dynamics in Central 
America; this hydraulics-based approach holds considerable promise 
for model improvement®®. 

Second, a key component of the hydraulic pathway is the relationship 
between plant water status (represented by pre-dawn plant water poten- 
tial, Y%4) and soil water availability (represented by soil water potential, 
Woi1). Simple models that treat soil water as a single bucket generally 
fail to capture this relationship; it appears that models need to incor- 
porate vertical gradients in soil moisture potential, the distribution of 
roots and changing soil-root resistance with soil drying®*°°, However, 
it is also commonly observed that co-occurring species can have dif- 
ferent Yq when W%oi is the same”, and this difference cannot always 
be explained by rooting distributions. The %q can be lower than Weoi1 
if overnight equilibration is insufficient®’, or considerable amounts of 
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night-time transpiration occur®’. On the other hand, root shrinkage or 
cortical deformation in dry soil can create an air gap around roots*?”*, 
preventing equilibration and leading to a %, that is less negative than 
the Y,,i. Foliar water uptake can also lead to shoot rehydration dise- 
quilibria with W,,i and these effects can be important for arid environ- 
ments”. There has been little focus to date on the ability of models to 
replicate measurements of Yq. 

Third, it is clear that processes that occur after stomatal closure are 
important, however these processes have been given less consideration 
in models**!™. Stomata generally close well before the thresholds for 
pronounced cavitation are reached*”>*!°!. Further increases in cavi- 
tation will occur only if models also represent plant water loss when 
stomata are closed. In very dry soil, the uptake of water by roots is no 
longer possible and plants have to rely on their internal water storage 
for survival'®°8, Plant water storage (an absolute amount of water) is 
distinct from the capacitance (the slope of water content versus poten- 
tial). Whereas capacitance is often incorporated into models because 
it determines the dynamics of the water potential in leaves'©’, plant 
water storage is not. The depletion rate of the plant water store when 
stomata are (nearly) closed is governed by the plant leaf area (discussed 
above), cuticular conductance“ and, probably less importantly, water 
loss through the bark’. Comparatively few measurements are available 
on cuticular conductance and plant water storage*’, hindering model 
parameterization for this final phase towards drought mortality. At 
this point, very few process-based models that have aimed to simulate 
drought mortality incorporate both plant water storage and cuticular 
conductance. A notable exception is provided by the recently developed 
SurEau model”. 

In summary, trait-based models that incorporate our best under- 
standing of hydraulic processes hold promise for predicting plant mor- 
tality in response to drought. We have highlighted three processes that 
deserve further attention, both in terms of model development and 
compilation of necessary data for parameterization. There are likely 
to be other problems as well, perhaps even larger ones—it is worth 
noting that no purely process-based model has yet been successful in 
predicting tree mortality®®. Model testing must thus continue before 
we can defensibly use these models for forecasting. We recommend a 
close coordination between experimentalists and modellers to improve 
models and their evidence base. 


Recovery of hydraulic capacity 
Although much attention has been devoted to determining the physi- 
ological basis of tree mortality during drought, it is equally important 
to understand the processes of recovery in trees that survive drought. 
What are the effects of drought on the hydraulic function of plants 
and how quickly can plants recover to the pre-drought levels of phys- 
iological performance? Predicting the resilience and recovery of for- 
ests is complicated by the predisposition of trees that enter a drought 
event, which includes the cumulative effects of previous water deficit, 
pest outbreaks and forest demographics, as well as potential delays 
between stress events that influence survival, mortality and recovery 
processes'®°8:105:04, Recovery of trees after drought is therefore com- 
plex, dynamic and determined by at least (1) the degree of damage to 
the apical and cambial meristematic tissues; (2) the functional status of 
the remaining hydraulic pathway; (3) the overall health of trees (that is, 
the remaining foliage and roots); and (4) the water, non-structural car- 
bohydrates and nutrients that are available during the recovery phase. 
Mild drought stress does not typically result in high levels of cavi- 
tation, although it may result in the transient and easily reversible loss 
of hydraulic capacity that is associated with mechanisms such as con- 
duit collapse in leaf veins and aquaporin regulation of cell membrane 
permeability**“®. In cases in which drought stress has caused consid- 
erable hydraulic dysfunction without mortality, hydraulic recovery 
could occur by two mechanisms: new wood formation or by refilling 
embolized conduits. Regrowth of the xylem appears to be the primary 
means by which trees recover hydraulic capacity after drought’*’. 
This straightforward process involves the addition of new conduits 
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(vessels or tracheids) to outer regions of the xylem through the activity 
of the vascular cambium. This replaces the hydraulic conductance that 
is lost by embolized conduits, which may then become permanently 
occluded by gums or tyloses!®°. In cases in which drought has caused 
considerable death of aboveground biomass, recovery may be facili- 
tated by resprouting of stems from epicormic buds or lignotubers’”. 
The prevalence of resprouting is highly variable among tropical and 
temperate forest species and is ultimately dependent on the protection 
and survival of the meristematic tissue to produce new shoots!*!0”, 

An alternative mechanism for recovery of hydraulic capacity, which 
would be far more rapid than regrowth, is refilling of embolized xylem 
conduits. Much research has focused on the potential active mech- 
anisms by which plants could refill embolized xylem conduits after 
drought'®*""””. Springtime refilling following freeze-thaw cycles that 
produce embolism over winter is well-documented and apparently 
dependent on the positive pressure that is generated in the roots or 
stems of deciduous angiosperms’'!'!?. A number of studies have 
shown rapid refilling of embolized vessels after mild drought in her- 
baceous species''*'!* and previous studies have suggested that daily 
cycles of cavitation and refilling are common in some tree species!!®!!”, 
However, recent work has cast doubt on this phenomenon and convinc- 
ing evidence for short-term refilling after drought in large trees is gen- 
erally lacking’. It appears unlikely that trees can establish the positive 
pressure that is required to remove emboli in transpiring tissues many 
metres above the soil surface, making refilling under sustained tension 
within the current theoretical framework thermodynamically unten- 
able’. Although it is thought that some woody species can remove 
drought-induced emboli within hours or days after a soil-saturating 
event in combination with non-transpiring conditions, this has only 
been documented unambiguously in grapevines, which are well-known 
for their capacity to produce considerable root pressure!!®!°. Other 
studies using non-invasive imaging have failed to provide evidence of 
refilling in woody species after drought*!'”%'"". Thus, we believe it is 
unlikely that refilling in trees is a common mechanism for rapid recov- 
ery of hydraulic capacity after drought, although further experimental 
studies are required to confirm this. 

The rate of hydraulic recovery after exposure to drought is largely 
is dictated by the extreme of negative water potential that is reached, 
and the amount of time that is spent at this extreme. Drought recovery 
after rainfall occurs on very short time scales if high levels of cavi- 
tation have not taken place'”* with the rapid opening of stomata to 
fix new carbon from the atmosphere as plants re-hydrate. However, 
in cases in which cavitation thresholds are breached, the recovery of 
photosynthesis is much slower!”” and proceeds in coordination with 
the onset of the restoration of hydraulic condutance from the soil to the 
canopy'”*!. Growth of new xylem to replace compromised tissues 
requires long-distance signalling from the roots and leaves'”’, and 
xylogenesis is affected by the post-drought environment because of 
sensitivity to temperature, plant growth regulators, carbohydrate pools 
and water availability'**'”°. Following drought, trees invest substantial 
carbon resources into rebalancing the root to shoot ratio’, increasing 
fine root biomass, and exploring deeper regions of the soil profile to 
recover plant water status'””. Because meristematic tissue is dependent 
on adequate water availability and phloem transport to establish the 
turgor that is necessary for xylem cell expansion!”8, xylem development 
during drought conditions is often markedly reduced. Thus, functional 
water transport and signalling pathways must be in place to coordinate 
these events and initiate recovery. 


The way forwards 

Hydraulic physiology is central to our understanding of how trees 
respond to drought, and the pathways leading to drought-induced mor- 
tality. Because the hydraulic system is fundamentally linked to carbon 
balance through stomatal regulation, a mechanistic understanding of 
plant hydraulic function should greatly improve modelling of vegeta- 
tion dynamics under water-limiting conditions. With recent progress 
in methodology, we now stand at an exciting threshold in the field. 
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There is potential for great leaps in our understanding of plant hydrau- 
lic function and our ability to quantitatively link these physiological 
mechanisms to forest ecology. Despite this, many important challenges 
remain. Below, we outline research priorities and key knowledge gaps 
that constrain our ability to predict drought-induced mortality and 
recovery after drought. 

In terms of manipulative drought experiments, it is clear that stud- 
ies incorporating long-term droughts on large trees are essential. 
Thresholds for hydraulic failure have typically been determined on 
small plants with intense drought treatments and important differences 
may exist in large trees exposed to droughts of greater duration. This is 
particularly applicable to determining the time a tree must spend at or 
below a particular threshold in order for mortality to occur. Long-term 
experiments will also provide an improved understanding of how tree 
water relations and carbon balance interact to cause mortality during 
prolonged droughts. 

Better monitoring of plant water potential in communities under 
drought is also essential in this context. The importance of Yin as a 
parameter cannot be understated, since the Y% reached by plants is 
what largely determines the probability that hydraulic failure occurs. 
It integrates many aspects of plant structure and physiology, and their 
interaction with climatic and edaphic variables’®. Other thresholds, 
such as the point at which leaf turgor is lost and stomatal closure occurs, 
can be predicted for a species based on continuous WY, data, although 
seasonal acclimatization in these parameters needs to be taken into 
account. Unfortunately, datasets of Yin are typically patchy because 
of the laborious onsite measurement techniques that are involved. 
Continuous remote monitoring of WY, is now becoming possible because 
of the development of a new generation of wireless sensors that can be 
deployed at remote sites'””!*°. This will enable a much higher resolution 
of Yuin, the true maximum level of stress that trees are exposed to 
during natural droughts, and how long they spend at a given VY. 

Improved methodology for the measurement of vulnerability to 
cavitation is also being developed to enable in situ measurement of 
hydraulic thresholds*”** and more rapid phenotyping of thresholds 
across species and populations. Non-invasive imaging techniques, 
such as X-ray micro-computed tomography and magnetic resonance 
imaging, are providing new insights into plant hydraulic function and 
response to drought. Although these techniques are often costly or 
difficult to access, they provide unparalleled spatial and temporal res- 
olution for observations in living, intact plants. Further work in this 
area will be vital to unravel some persistent mysteries of plant vascular 
transport including the phenomenon of embolism repair after drought 
and the degree to which it is active in woody plants. Improvements in 
techniques are also essential for the quantification of plasticity within 
species and their capacity to acclimatize or adapt to drier conditions. 
A recently developed optical technique shows great promise to facili- 
tate cost-effective high-throughput measurements of vulnerability to 
cavitation in leaves, stems and roots**. Methods that use centrifugal 
force to generate vulnerability curves also enable rapid phenotyping® 
provided the appropriate methodological precautions are observed)". 

The use of whole-tree techniques, for example, sap flow, is neces- 
sary to provide datasets that interface with databases of tissue-level 
traits. Measuring sap flow and consolidating existing sap flow datasets 
worldwide (for example, the Sapfluxnet project)!” represent important 
steps in linking tissue-level hydraulic traits to water fluxes and testing 
large-scale model predictions of the effect of water stress on plants 
and ecosystems. At a broader scale, remote-sensing tools will be essen- 
tial to monitor tree mortality and the dynamics of drought recovery. 
Although there are issues to be solved with detection of mortality using 
satellite-based sensors!**, aircraft-based sensors are delivering better 
resolution of water stress and tree death at stand and regional scales”. 
Although the goal of attaining accurate model predictions of tree 
mortality due to drought remains elusive, resolution of the challenges 
outlined above represents a clear path forward. Success in this area will 
require direct collaboration between experimentalists and modellers, as 
the effective parameterization of process-based models depends on the 


REVIEW 


acquisition and sharing of often hard-won data (for example, rooting 
depth, cuticular conductance and cavitation resistance). Finally, we 
emphasize that hydraulic failure is not the only pathway to mortality 
associated with drought but rather the most tractable to address with 
process-based models at this time. The future integration of other phys- 
iological thresholds, as well as interactions with pests and pathogens, 
is the natural course along which we should proceed. 
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PIK3CA-related overgrowth syndrome 
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CLOVES syndrome (congenital lipomatous overgrowth, vascular malformations, epidermal naevi, scoliosis/skeletal 
and spinal syndrome) is a genetic disorder that results from somatic, mosaic gain-of-function mutations of the PIK3CA 
gene, and belongs to the spectrum of PIK3CA-related overgrowth syndromes (PROS). This rare condition has no 
specific treatment and a poor survival rate. Here, we describe a postnatal mouse model of PROS/CLOVES that partially 
recapitulates the human disease, and demonstrate the efficacy of BYL719, an inhibitor of PIK3CA, in preventing and 
improving organ dysfunction. On the basis of these results, we used BYL719 to treat nineteen patients with PROS. The drug 
improved the disease symptoms in all patients. Previously intractable vascular tumours became smaller, congestive heart 
failure was improved, hemihypertrophy was reduced, and scoliosis was attenuated. The treatment was not associated 
with any substantial side effects. In conclusion, this study provides the first direct evidence supporting PIK3CA inhibition 


as a promising therapeutic strategy in patients with PROS. 


The phosphoinositide-3 kinases (PI(3)Ks) are key lipid kinases that 
control signalling pathways involved in cell proliferation, motility, 
survival and metabolism!. Class I PI(3)K contains four cata- 
lytic isoforms (p110a, p1108, p1106 and p110-+), which carry out 
non-redundant signalling functions!. PIK3CA encodes the 110-kD 
catalytic a-subunit of PI(3)K (p110a), which converts phosphatidy- 
linositol (3,4)-bisphosphate (PtdIns(3,4)P2) to phosphatidylinositol 
(3,4,5)-triphosphate (PtdIns(3,4,5)P3). This leads to the activation 
of PDK1, which phosphorylates AKT on Thr3087?. AKT, through 
the phosphorylation and inhibition of tuberous sclerosis complex 2 
(TSC2), activates the kinase mechanistic target of rapamycin complex 1 
(mTORC1)*. PIK3CA is a master regulator of cell growth and other 
pathways, including the Rho/Rac] signalling cascade. PROS are caused 
by post-zygotic mosaic gain-of-function mutations in the PIK3CA 
gene” , resulting in mosaic PI3KCA activation, which drives various 
patient phenotypes, ranging from isolated macrodactyly to CLOVES 
syndrome’. Notably, an increasing number of phenotypes has been 
included in PROS asa result of the identification of PIK3CA mutations 
in previously uncharacterized overgrowth syndromes. Thus, the preva- 
lence of PROS is difficult to estimate owing to the variability of correct 
clinical diagnoses and the broad phenotypic spectrum of the disorders’. 

There are no animal models of PROS that can be used to understand 
its physiopathology, and no specific treatments for patients®”. Patients 
with PROS mainly receive supportive care based on debulking and 


mutilating surgery, sclerotherapy, and psychological and nutritional 
support. Inhibition of mTORC seems to improve lymphatic malfor- 
mations in some patients but is associated with many side effects**. 
Several PIKC3A inhibitors are under development as treatments for 
oncological conditions, where gain-of-function mutations in PIK3CA 
are observed’. Among them, BYL719 shows dose- and time-dependent 
inhibition of the PISK/AKT pathway in PIK3CA-dependent xenograft 
tumour models'™!!, As BYL719 is currently under investigation in clin- 
ical trials in patients with PIK3CA-dependent tumours and has good 
tolerability’’, we decided to explore its therapeutic potential in PROS. 


A mouse model of PROS/CLOVES 

We started by developing a mouse model of PROS/CLOVES. 
To this end, we took advantage of the transgenic mouse strain, 
R26StopFLP11 0*!34 which, after breeding with Cre recombinase mice, 
express a dominant active PI3KCA transgene (Extended Data Fig. 1a, b). 
R26StopFLP110* mice were crossed with CAGG-CreER mice’* to 
generate PIK3CA™40@-C® animals that ubiquitously express PIK3CA 
upon tamoxifen administration, and EGFP to monitor Cre recombina- 
tion. Three-week-old mice were treated with a single dose of 40 mg kg! 
tamoxifen to induce Cre recombination’. EGFP staining revealed that 
Cre recombination occurred in 25.5 + 8.3% of splenocytes (Extended 
Data Fig. 1c). After tamoxifen administration, we observed that the 
PIK3CACACG-CrER mice started to rapidly die with 50% of mortality at 


INSERM U1151, Institut Necker Enfants Malades, Paris, France. 2Université Paris Descartes, Sor' 
Malades, AP-HP, Paris, France. “Service de Dermatologie Pédiatrique, Hopital Necker-Enfants Ma 


onne Paris Cité, Paris, France. Service de Chirurgie Viscérale Pédiatrique, Hopital Necker-Enfants 
ades, AP-HP, Paris, France. °>UMR-1163 Institut Imagine, Hopital Necker-Enfants Malades, AP-HP, 


Paris, France. "Département de Radiologie Pédiatrique, Hdpital Necker-Enfants Malades, AP-HP, Paris, France. 7Département d’Anatomopathologie, Hépital Necker-Enfants Malades, AP-HP, Paris, 


France. ®Département de Médecine Nucléaire, Hépital Marie Lannelongue, Le Plessis Robinsson, 
WA, USA. !°Service d’Orthopédie Pédiatrique, Hopital Necker-Enfants Malades, AP-HP, Paris, Fran 
Paris, France. !2Service de Néphrologie, Transplantation, Dialyse, Aphéréses, Centre Hospitalier U 


France. Department of Integrative Brain Research, Seattle Children’s Research Institute, Seattle, 
ce. Service de Néphrologie Transplantation Adultes, Hopital Necker-Enfants Malades, AP-HP, 
niversitaire Pellegrin, Bordeaux, France. !73UMR CNRS 5164, Immuno ConcEpT, CNRS, Bordeaux, 


France. !4Service d'lmagerie Diagnostique et Interventionnelle de |’Adulte, Centre Hospitalier Universitaire Pellegrin, Bordeaux, France. !Service de Génétique Médicale, Hopital Necker-Enfants 
Malades, AP-HP, Paris, France. !®Service de Chirurgie Maxillo-faciale et Chirurgie Plastique, Hopital Necker-Enfants Malades, AP-HP, Paris, France. !’Laboratory EA 2496 Orofacial Pathologies, 


Imaging and Biotherapies, Montrouge, France. ‘Service de Neuropédiatrie, Hopital de la Mére et 


de Enfant, Limoges, France. !°Service d'Imagerie Pédiatrique, Hépital Femme-Mére-Enfant, Bron, 


France. 2°Pharmacie, Hopital Necker-Enfants Malades, AP-HP, Paris, France. 2!These authors con 


ributed equally: Thomas Blanc, Smail Hadj Rabia. “e-mail: guillaume.canaud@inserm.fr 


NATUR E|www.nature.com/nature 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


ARTICLE 


© pixscat 4 weeks 
BYL719 


B_| 
> POF 
PIK3CACAGG-CreER Vehicle 


Dp. 


re zy 
> 8 
o 


— PIK3CAWT 


Tamoxifen e PIK3CACAGG-CreeR 


PIK3CA“T 


Per cent survival 


P< 0.0001 2 PO _y 


0 15 30 45 60 75 


* Vehicle 


y. 
£ 
% MRI 
E 
oO 


PIK3CACAGG-CreER 


MRI 
£ 


c PIK3CACAGG-CreER f 
BYL719 preventive 


— PIK3CACAGS-CreER + vehicle 
== PIK3CACAGG-CreER 
BYL719 preventive 
Tamoxifen Withdrawal 


v BYL719 y 


——e 1 


PIK3CAWT Vehicle BYL719 therapeutic 


PAS 

Jaa 
3 
Oo 


5 
' 
' 
5 

1 


Per cent survival 


1 
P< 0.0001'; 


0 15 30 45 60 75 
Days 


HE 
uaa\ds 


g — PIK3CACAGS-CreER + vehicle 
== PIK3CACAGG-CreER 
BYL719 therapeutic 
Tamoxifen 


100 ¥ 


Trichrome 
Aeuply 


BYL719, 


Trichrome 
Per cent survival 


P <0.0001 


0 10 20 30 40 50 60 70 
Days 


d PIK3CACAGG-CreER 


PIK3CA“T Vehicle 


7; 
or > re ee — 60 kDa 
> = BR mt TT - 2000 


B-actin —p> qe seme em Sm mre ee eee ee — 42 KD 


BYL719 preventive BYL719 therapeutic 


P-AKT(Ser43) —p 
P-AKT(Thr208) p> 
P-SCRP p> 


wwe eee — 60KkDa 


Jeary 


P-AKT(Ser473) —p ——_—— ee — 60 kDa 
P-AKT(Thr208) —p _—_——_— wwe — 60kDa | F 
PSP — — ——$- ee = 36 kDa | + 


_—-_— = 
pati re cee cee a — 49 kDa 


P-AKT(Ser’3) —p —— 
P-AKT(Thr208) p> 
P-S6RP —p 


B-aCtin —> we ee ee ee SE re ee cee eee ee — 42 kDa 


—————— 60 kDa 


— = —60kDa 


9josnw 


om ee — 86 kDa 


Fig. 1 | Characterization of the PROS mouse model and efficacy of 
BYL719. a, Kaplan-Meier survival curves of PIK3CA™" and 
PIK3CA@4C@-CreER mice (n= 16 mice per group) after tamoxifen 
administration (log-rank test, P< 0.0001). b, Representative necropsy 
examination pictures of PIK3CA©46C-C"-® mice. The mice displayed 
sudden intra-abdominal and spontaneous hepatic haemorrhage (n = 16 
mice). c, Morphology of livers (top), spleens (middle) and kidneys 
(bottom two rows) from PIK3CA™! and PIK3CA©4°¢-C”ER mice that were 
treated with or without BYL719 directly after Cre induction (preventive) 
or seven days later (therapeutic) (n = 8 mice per group). d, Western blot 


day 9 (Fig. 1a). Death occurred suddenly in most cases, with necropsy 
revealing intra-abdominal and hepatic haemorrhages (Fig. 1b). Whole 
body magnetic resonance imaging (MRI) showed scoliosis, vessel 
abnormalities, kidney cysts, and muscle hypertrophy (Extended Data 
Fig. le, left and middle columns). Histological examination revealed 
multiple organ abnormalities, including severe liver steatosis with vessel 
disorganization, loss of spleen microarchitecture integrity, spontaneous 
haemorrhages and fibrosis of the kidney with aberrant vessels (Fig. 1c, 
Extended Data Fig. 2a, b). We observed a high number of proliferat- 
ing cells in all the affected organs of the PIK3CA™CC- CER mice, as 
assed by Ki67 immunostaining (Extended Data Fig. 3a). By contrast, 
we failed to detect any change in apoptosis (Extended Data Fig. 3b) 
or senescence (Extended Data Fig. 4a, b). To further characterize the 
previously reported vessel malformations, we performed CD34 and 
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of P-AKT (Ser*”3), P-AKT (Thr*”’) and P-S6RP in liver, heart and muscles, 
respectively, from PIK3CA™? and PIK3CAC46@-CER mice treated with 

or without BYL719 directly after Cre induction (preventive) or seven 

days later (therapeutic) (n =8 mice per group). e, Experimental design. 

f, Kaplan-Meier survival curves of PIK3CAC46@-CER mice treated with 

or without BYL719 after tamoxifen administration (m= 16 mice per 
group). After 40 days, BYL719 treatment was withdrawn (log-rank test, 
P<0.0001). g, Kaplan-Meier survival curves of PIK3CAC46@-C"ER mice 
treated with or without BYL719 (n= 12 mice per group) seven days after 
tamoxifen administration (log-rank test, P< 0.0001). Scale bars, 20 1m. 


CD31 immunostaining, which confirmed the presence of severe vessel 
dilation (Extended Data Fig. 2c). Similarly, podoplanin and LYVE-1 
confirmed expansion of lymphatic vessels (Extended Data Fig. 2d, e). 
We confirmed that mutant p110ca (p110*) was expressed in all affected 
organs (Extended Data Fig. 5a). Notably, we did not detect expres- 
sion in the brain or lungs (Extended Data Fig. 5b), consistent with the 
normal histology of these organs (data not shown). Western blot and 
immunofluorescence studies showed AKT/mTORC pathway activation 
in all the examined organs (Fig. 1d and Extended Data Figs. 5c-e, 6). 
In transfected HeLa cells, the p110* mutant had a stronger effect on 
AKT/mTORC pathway activation than did the c.3140A>G (H1047R) 
and c.1633G>A (E545K) mutations from patients used in previously 
reported models!®!” (Extended Data Fig. 7), which may explain the 
increased phenotypical severity observed in our model. 
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BYL719 prevents the PROS/CLOVES phenotype 

We next studied the effect of BYL719 on PIK3CA™4°@-CER mice. 
Initially, BYL719 was administered orally each day starting on the day of 
Cre induction (Fig. le). Whereas all placebo-treated PIK3CACAGG-CreER 
mice died within 15 days, all BYL719-treated PIK3CACASC-CER mice 
were alive after 40 days and had an overtly normal appearance (Fig. 1f). 
Histological examination at 40 days showed that the BYL719 treated 
mice had preserved tissues (Fig. 1c and Extended Data Fig. 2a) and 
normal vessels (Extended Data Fig. 2c). Mechanistically, we observed 
a strong reduction of proliferation in PIK3CA“°° C® mice treated 
with BYL719 (Extended Data Fig. 3a), whereas senescence and apop- 
tosis did not change (Extended Data Figs. 3b, 4a, b). Western blot and 
immunofluorescence analyses confirmed that PISKCA pathway activa- 
tion was effectively inhibited (Fig. 1d and Extended Data Fig. 5c-e, 6). 
However, interruption of treatment 40 days after Cre recombination 
led to the rapid death of all PIK3CA@46C- CER mice (median survival 
9.8 days after withdrawal of the drug; Fig. 1f). 

Next, we administered either placebo or BYL719 to PIK3CACAGG-CreER 
mice seven days after Cre induction (Fig. le), when tissue abnormalities 
were already detected by MRI (Extended Data Fig. le). BYL719 treat- 
ment improved the survival of the PIK3CA“40¢ ©" mice (Fig. 1g). 
MRI performed 12 days after the start of treatment (19 days after Cre 
induction) showed improvements in scoliosis, muscle hypertrophy, and 
vessel malformations (Extended Data Fig. le). Histological analysis of 
BYL719-treated mice revealed only minor tissue changes compared to 
wild-type mice (Fig. 1c, Extended Data Fig. 2a). BYL719 administration 
strongly reduced cell proliferation in all affected organs (Extended Data 
Fig. 3a), and western blot and immunofluorescence analyses confirmed 
PIK3CA pathway inhibition (Fig. 1d, Extended Data Figs. 5c-e, 6). 

To try to reproduce more faithfully the lower mosaicism observed 
in patients’®, we used a single dose of 4 mg kg~! tamoxifen to induce 
Cre recombination. We confirmed that a lower dose of tamoxifen 
induced a lower rate of mosaicism by fluorescence-activated cell sort- 
ing (FACS; 4.1 + 1.3%, Extended Data Fig. 1d). The mice survived for 
two months and then died with multiple phenotypic abnormalities. 
In addition to organomegaly’’, these mice progressively developed 
asymmetrical overgrowth of extremities, disseminated voluminous 
tumours and visible subcutaneous vascular abnormalities (Fig. 2a). 
Histological examination of the tumours revealed the same lesions 
observed in patients with PROS (lipomatous tumours (Fig. 2b) and 
severe vascular lesions mixing venous and arterial vessels (Fig. 2c-e). 
More precisely, we observed ecstatic venous channels with a thin 
endothelial cell lining, surrounded by sparse, erratically distributed 
vascular smooth muscle cells and a disorganized extracellular matrix, 
as is seen in PROS/CLOVES (Fig. 2e). Immunofluorescence and 
immunochemistry studies confirmed the increase in cell prolifera- 
tion and activation of the AKT/mTORC pathway (Fig. 2f, Extended 
Data Fig. 7b). 

Once the lesions were clinically visible, we treated the low-dose 
PIK3CA@SG-CrER mice with BYL719. Remarkably, the treatment 
led to the reduction and disappearance of all visible tumours within 
two weeks, with body weight loss (Fig. 2g, Extended Data Fig. 8a). 
Computerized tomography (CT) scanning showed a marked and rapid 
reduction in tumour volume (—83.96 + 2.81%), with no change in fat 
content (Extended Data Fig. 8b). 

Notably, withdrawal of BYL719 led to the recurrence of tumours 
and the development of asymmetric extremity hypertrophy within four 
weeks (Fig. 2h, Extended Data Fig. 7c). These data suggest that continu- 
ous administration of BYL719 can relieve PROS symptoms at different 
stages by inhibiting hyperactive PI3KCA signalling. 


BYL719 is more effective than rapamycin 

As the mTOR inhibitor rapamycin has shown some efficacy in treat- 
ing vascular malformation”, we tested its efficacy in our mouse 
models. In the first PIK3CA@46¢-E mouse model, which received 
40mg kg~’ tamoxifen, rapamycin improved survival (Extended Data 
Fig. 9a). However, it did not improve organ abnormalities such as liver 
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and spleen disorganization with aberrant vascularization (Extended 
Data Fig. 9b, c). Western blot analysis revealed that rapamycin blocked 
mTORCI1 activity but, as previously reported”!, did not inhibit 
mTORC2 as assessed by the persistent phosphorylation of AKT on 
Ser473 in all the examined organs (Extended Data Fig. 9d). Similarly, 
in the PIK3CA™°®-C"ER mouse model that received 4 mg kg”! tamox- 
ifen, rapamycin treatment did not reduce tumour growth significantly 
(Extended Data Fig. 9e). In parallel to these studies, we exposed 
primary mouse fibroblasts from PROS mice to either rapamycin or 
BYL719. Mutant p110* overexpression led to the phosphorylation of 
AKT on both Ser473 and Thr308 as well as the phosphorylation of 
S6RP (Extended Data Fig. 10a, b). By contrast, we could not detect 
any activation of the p38 and p44/42 MAP kinase pathway. Notably, 
unlike rapamycin, BYL719 blocked AKT phosphorylation on both 
Thr308 and Ser473, suggesting that the beneficial effect of BYL719 
could be driven by more complete blockage of AKT (Extended Data 
Fig. 10a, b). These results confirmed that BYL719 could be a good 
therapeutic option even for patients for whom rapamycin is not an 
effective option. 


BYL719 in two patients with CLOVES syndrome 

We administered BYL719 under compassionate care protocols to two 
patients, one adult and one child, who were suffering from extremely 
severe clinical manifestations of PROS/CLOVES syndrome with ther- 
apeutic failure and life-threatening complications. Patient 1 was a 
29-year-old man with PIK3CA c.3140A>G (H1047R) mutation (11% 
mosaicism). He presented left leg hypertrophy, scoliosis, multiple 
naevi, and extremely severe vascular malformations (Fig. 3a—c). He 
had previously undergone multiple tumour debulking surgeries and 
angiographies. He became paraplegic at the age of 20 owing to spinal 
cord compression and required bladder stenting. He developed pro- 
gressively worsening, drug-resistant, severe systolic heart failure with a 
measured cardiac output of 181 min~! m~? (normal < 3.51 min7! m7”). 
Consistently, the brain natriuretic peptide (BNP) level was over 
2,500 pg ml“! (normal < 100 pg ml7') (Fig. 3d). Over five years, the 
patient received different doses of rapamycin” to limit vascular tumour 
growth progression, but this had no overt effects on his abdominal and 
dorsal vascular tumours. Finally, the patient developed kidney dys- 
function with severe proteinuria. A kidney biopsy revealed glomerular 
lesions with extensive fibrosis. Such lesions may be the result of heart 
dysfunction, the administration of rapamycin>*, and/or PIK3CA muta- 
tions in the kidney epithelial cells. CT scan (Fig. 3b) and MRI (Fig. 3e) 
analyses showed severe vascular malformations, while positron emis- 
sion tomography (PET) scan analysis was negative. Karnofsky and 
Eastern cooperative oncology group (ECOG) scores, which indicate 
general health status*>*4, were 20% and 4, respectively. Owing to the 
severity of the case and poor prognosis, the physicians, surgeons, and 
radiologists decided to stop any interventional treatment and to provide 
the patient with only supportive and palliative care. His estimated life 
expectancy was less than a few months. One month after rapamycin 
washout, daily oral BYL719 was started at 250 mg per day, the low- 
est dose used in clinical trials. We observed a rapid improvement in 
the general status of the patient, as well as a progressive reduction in 
tumour size, venous dilations and skin lesions (Fig. 3a). He lost 25 kg 
in 18 months due to oedema clearance but also to a major reduction 
in vascular tumour abnormalities (Fig. 3c). His thorax and abdomen 
circumferences were reduced by 25% and 39%, respectively, over 
18 months (Fig. 3c). CT scan and MRI analyses confirmed global 
vascular tumour shrinkage (72% volume decrease at 18 months), as well 
as disappearance of subcutaneous infiltration (Fig. 3b, e). The effect of 
the drug on heart function was remarkable, with complete correction 
of BNP levels within 4 weeks (Fig. 3d). Cardiac output decreased to 
31 min7' m~, and his heart size reduced by 25% (Fig. 3b). The left 
ventricular mass index to body surface area decreased from 250g m~* 
to 148g m7’. Renal function also improved rapidly, with estimated 
glomerular filtration rate increasing from 33 to 52 ml min“ per 1.73 m? 
(Fig. 3d). The hypertrophy of the left leg was also reduced (Fig. 3a). We 
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Fig. 2 | Characterization of the second mouse model of PROS and 
efficacy of BYL719. a, Morphology of PIK3CA@4C¢-C"ER mice one month 
after Cre induction with a single dose of 4mg kg~' tamoxifen. The mice 
displayed multiple visible deforming tumours across the whole body (red 
arrows). Some of these tumours had huge vascular irregularities. The mice 
progressively developed hypertrophic extremities, enlarged liver with 
aberrant vessels, and multiple intraabdominal vascular malformations 
(pictures representative of nine mice). b-d, Histopathological examination 
of the tumours revealed the presence of lipomatous tumours (b; n = 10 
mice examined) and multiple venous malformations composed of ecstatic 
venous channels with a thin endothelial cell lining, surrounded by sparse, 
erratically distributed vascular smooth muscle cells and a disorganized 
extracellular matrix (c, d; n = 10 mice examined). Scale bars, 501m. 

e, Coimmunostaining of CD34 and a-smooth muscle cells in venous 
malformation (n= 10 mice examined). f, Immunofluorescence staining 


observed a marked improvement in skin hypertrophy, with a change 
in nevus coloration and a reduction in ear size (Fig. 3a). Finally, after 
6 months of BYL719 treatment, the patient began to partially gain 
bladder function, with an improvement in saddle anaesthesia. MRI 
revealed a 60% reduction in the size of the venous malformation that 
was compressing his spinal cord. The patient’s Karnofsky and ECOG 
scores increased to 80% and 1, respectively. After eighteen months of 
treatment, the patient had no side effects except hyperglycaemia which 
was well controlled with nutrition therapy. BYL719 leads to peripheral 
insulin resistance’! and hyperglycaemia was one of the most common 
drug-related adverse events in the phase I trial’. The patient is 
currently still under BYL719 treatment. 
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for P-AKT (Ser*”?) and P-S6RP in the liver, subcutaneous tumour, ear 
tumour and kidneys of PIK3CA™? and PIK3CAC4G@-CER mice (n= 10 
mice examined per group). Scale bars, 10 jum. g, PIK3CA°46C-C"ER mice 
were injected with a single dose of 4mg kg! tamoxifen and followed 

for one month after Cre induction. Once the tumours reached a certain 
volume, the mice were treated with BYL719 for two weeks and this led 
to the disappearance of all tumours (n= 18 mice). h, PIK3CAC4C@-CreER 
mice were injected with a single dose of 4 mg kg~! tamoxifen and followed 
for one month after Cre induction. Once the tumours reached a certain 
volume, the mice were treated with BYL719 for two weeks and this led to 
the disappearance of all tumours (n =7 mice). Then, BYL719 treatment 
was withdrawal and we observed the recurrence of tumours and 
vascular malformations in PIK3CA©4°¢-CER mice within the next 

four weeks. 


Patient 2 was a nine-year-old girl with a PROS/CLOVES syndrome 
diagnosed two years before this study and a gain of function PIK3CA 
c.3140A >G (H1047R) mosaicism mutation (12% mosaicism) con- 
firmed by skin biopsy. She had scoliosis, left leg hypertrophy, vascular 
malformations, and hypertrophic back muscles (Fig. 4a). She also had 
a voluminous cystic lymphangioma involving the left kidney and the 
gastrointestinal tract. A PET scan showed metabolic fixation in the thy- 
mus, back muscles, and left leg muscles (Fig. 4d). Her Karnofsky and 
ECOG scores were 50% and 3, respectively. BYL719 was started at the 
lowest available dosage: 50 mg per day. As in the first patient, we observed 
a marked clinical improvement rapidly after the onset of treatment 
(Fig. 4a). The patient reported improved comfort and we observed a 
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Fig. 3 | Efficacy of BYL719 treatment in an adult patient with severe 


CLOVES syndrome. a, Representative pictures of patient 1 before and 
360 and 540 days after onset of BYL719 treatment. b, CT scans of patient 
1 before and 360 and 540 days after onset of BYL719 treatment. The 
magnifications are the same. c, Clinical parameters (weight, chest girth 
and waist circumference) of patient 1 before and after onset of BYL719 


24% reduction in her left leg volume, reduced left foot volume, and a 14% 
reduction in her abdominal circumference (Fig. 4b) after 12 months. 
The hypertrophic muscles in her back also shrank rapidly and, unexpect- 
edly, her scoliosis was reversed after 6 months without any other inter- 
vention (Fig. 4c). More importantly, MRI showed a 40%, 54% and 71% 
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Fig. 4 | Efficacy of BYL719 treatment in a child patient with severe 
CLOVES syndrome. a, Representative pictures of patient 2 before and 
180 and 360 days after onset of BYL719 treatment. b, Clinical parameters 
(thigh and waist circumferences) of patient 2 before and after onset of 
BYL719 treatment. c, Three-dimensional CT scan reconstruction of the 
spine before and 180 and 360 days after onset of BYL719 treatment. 
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treatment. d, Serum BNP and creatinine levels before and after onset of 
BYL719 treatment. e, Three-dimensional MRI-based reconstructions of 
the intra-abdominal vessels before and after onset of BYL719 treatment. 
The total volume of the venous malformation was reduced by up to 72% 
after 540 days of treatment. 


reduction in intra-abdominal tumour volume after 4, 6 and 12 months 
of treatment, respectively (Fig. 4e). A PET scan performed at the same 
time points confirmed the marked reduction in metabolic activity of the 
affected tissues (Fig. 4d). The patient's Karnofsky and ECOG scores were 
greatly improved, to 100% and 0, respectively. Her height had increased 


Day 360 


Day 180 


Day 360 


Day 180 Day 360 @ Dayo Day 120 Day 180 Day 360 
° * 
ry ‘y 
e 
? ° 


d, PET scan images of patient 2 before and 120, 180 and 360 days 

after onset of BYL719 treatment. Arrows indicate the hypermetabolic 
tissues before and during treatment. e, Three-dimensional MRI-based 
reconstruction of the intra-abdominal tumour before and 120, 180 and 
360 days after onset of BYL719 treatment. The tumour volume was 
reduced by up to 71% after 360 days of treatment. Scale bar, 1 cm. 
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Fig. 5 | Efficacy of BYL719 treatment in patients with PROS. Images 

of patients 1-9 before and after six months of BYL719 treatment. Patient 
1 is a four-year-old girl suffering from severe vascular malformations 
involving the right arm and chest with permanent pain. After six months 
of treatment we saw a marked improvement in all vascular malformations 
as well as the scoliosis. Patient 2 is a 4-year-old girl with scoliosis and 
hypertrophic left buttock who had already undergone left foot partial 
amputation. After six months of treatment we saw an improvement in the 
scoliosis and reduction of the hypertrophic lesion. Patient 3 is a 5-year- 
old girl with CLOVES syndrome and chronic gastrointestinal bleeding. 
After six months of treatment, chronic bleeding stopped. Patient 4 is 

a 5-year-old girl with left hemifacial hyperplasia that was progressing 
despite multiple surgeries. After six months of treatment we saw for the 
first time an improvement in the hypertrophy (not shown for reasons of 


by 0.6cm per month during the 6 months before BYL719 treatment 
began, and growth remained stable at 0.6cm per month during the 
12-month treatment period. Her glycaemia remained normal and her 
glycated haemoglobin (HbA 1c) level, which was 5.5% before BYL719 
treatment, was 5.1, 4.9 and 5.0% after 3, 6 and 12 months of treatment, 
respectively. The patient is still receiving BYL719 treatment. 


BYL719 in a cohort of PROS patients 
On the basis of these results, we were authorized to administer BYL719 
to 17 additional patients with PROS who had life-threatening com- 
plications and/or were scheduled for debulking surgery (Fig. 5 and 
Extended Data Fig. 11a). The demographical characteristics of the 
patients are listed in Supplementary Table 1. All patients had doc- 
umented gain-of-function mutations in the PIK3CA gene, with ten 
different mutations (Extended Data Fig. 11b and Supplementary 
Table 1). All patients underwent clinical examination, laboratory eval- 
uation, PET and MRI scans before BYL719 administration and 3 and 
6 months after. Fourteen children were started at the lowest available 
dosage, 50 mg per day and three adults at 250 mg per day. Among the 
17 patients, six had been diagnosed with CLOVES syndrome, two 
with megalencephaly-capillary malformation (MCAP), and nine with 
localized overgrowth syndrome (back, limbs, face, arms) including 
three patients with abdominal or chest vascular tumours. In addi- 
tion, each patient had peculiar clinical features, and twelve patients 
had undergone previous debulking surgery (Fig. 5 and Extended Data 
Fig. 11a). Eight patients had received previous rapamycin treatment 
for18 months without clinical or radiological improvement. 

After BYL719, all patients showed substantial clinical improve- 
ment (Supplementary Table 1). All patients described reduced 
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confidentiality). Patient 5 is a 6-year-old boy with CLOVES syndrome. The 
treatment led to a reduction in lipomatous tumours and scoliosis. Patient 
6 is a 10-year-old girl with CLOVES syndrome and a severe lipomatous 
tumour on her back. BYL719 led to a marked improvement in the 

scoliosis and shrinkage of the tumour. Patient 7 is an 11-year-old boy with 
CLOVES syndrome, chronic gastrointestinal bleeding and severe dyspnea. 
Treatment improved all symptoms and the bleeding stopped. Patient 

8 is an 11-year-old girl with CLOVES syndrome and severe dyspnea. 
Treatment improved the subcutaneous lipoma as well as dyspnea. Patient 
9 is a 13-year-old girl with CLOVES syndrome, splenomegaly and severe 
vascular malformations involving the left kidney and limbs. Vascular 
malformations were markedly improved and the size of the spleen reduced 


after treatment. 


tiredness. The mean circumference of the clinical target lesions 
decreased by 12.6 +3.8 and 16.3 +3.9% after 3 and 6 months of 


BYL719 treatment, respectively 
Extended Data Fig. 11c). All patie 


(Supplementary Table 2 and 
nts demonstrated an improve- 


ment in skin capillary abnormalities and naevi became thinner. The 


two opioid-dependent patients who 


were confined to bed could stop 


morphine, and after 2 months of treatment were able to walk without 
help; tiredness improved, and haematuria disappeared in patient 11. 
Chronic gastrointestinal bleeding stopped in three patients, associ- 
ated with the correction of disseminated intravascular coagulation 
in patients 7 and 9, which led to the cessation of heparin treatment. 
We also observed an improvement in chronic palpebral cellulitis in 
patient 14 and we were able to progressively taper down this patient’s 
steroid treatment. All patients demonstrated a clinical improvement 
in scoliosis and the surgical corset was removed from patient 6. In 
the two patients with MCAP (patients 13 and 14), we observed an 
improvement in cognitive function and behaviour. After 6 months 


of treatment, all patients were still 
performed. 

Introduction of BYL719 initially 
mum after two months of treatment ( 


alive and no surgery had been 


led to weight loss with a maxi- 
—1.7 43.1%). However, after five 


months of treatment, body weight came back to baseline and patients 
started to gain weight (Extended Data Fig. 11d). Notably, four obese 


patients showed a marked reductio 


n in body weight (—9.7 + 8.2%; 


Extended Data Fig. 1le). This reduction was not associated with a 
reduction in food intake. MRI demonstrated an notable reduction in 
subcutaneous fat in these patients. The growth of the children* was 
not affected by the drug during the six months of treatment (Extended 


Data Fig. 12a). 
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In addition to the clinical improvement, we observed a radiolog- 
ical response in all patients. After 90 and 180 days of therapy, the 
mean volume of the target lesions had decreased by 27.2 + 14.6 and 
37.8 + 16.3%, respectively (Extended Data Fig. 12b-e). All target 
lesions responded to treatment. Notably, in the two patients with 
MCAP, MRI revealed an improvement in cerebral perfusion (Extended 
Data Fig. 12f). Finally, 5 of the 17 patients had hypermetabolic activity 
as assessed by PET scan. Remarkably, after 90 days of treatment we 
observed a drastic reduction in the metabolic activity of affected tis- 
sues (Extended Data Fig. 12g). 

Although our follow-up period was short, we observed a low rate 
of side effects using this drug. Indeed, we noticed only discrete mouth 
ulcerations in three patients during one week that disappeared spon- 
taneously (grade I). BYL719 was not associated with organ toxicity 
during this period (as assessed by heart, liver, kidney function and 
blood testing). Transient hyperglycaemia occurred in one obese patient 
(patient 15). Hyperglycaemia was well controlled with diet restric- 
tion only. Patient 17 was diabetic before BYL719 introduction and 
needed to increase the posology of oral antidiabetic drugs. Glycated 
haemoglobin remained normal during treatment in all patients 
(data not shown). All of the patients are still receiving BYL719 
treatment. 


Discussion 

We have described a mouse model of PROS/CLOVES that mimics ele- 
ments of the human phenotype by allowing titration of mosaicism; we 
have shown that BYL719 treatment rescues the PROS phenotype in the 
mouse model more efficaciously than rapamycin; and we have demon- 
strated that BYL719 is clinically effective, with a promising safety pro- 
file, in patients with PROS regardless of the type of PIK3CA mutation. 
This study provides the first direct evidence that PIK3CA inhibition is 
a robust and effective therapeutic strategy that can markedly improve 
the outcome of PROS patients. 


Online content 

Any Methods, including any statements of data availability and Nature Research 
reporting summaries, along with any additional references and Source Data files, 
are available in the online version of the paper at https://doi.org/10.1038/s41586- 
018-0217-9. 
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METHODS 
Mice. For this study, we interbred homozygous R26Stop*'P110* (Stock# 012343) 
and heterozygous CAGGCre-ER™ (Stock# 004682) mice on the C57BL/6 back- 
ground obtained from Jackson Laboratories. We obtained R26Stop™P110**/— 
x CAGGCre-ER™+ mice (referred to here as PIK3CA@46@-C"ER mice) and 
R26Stop™P110*+/+ x CAGGCre-ER™ mice (referred to here as PIK3CA“? 
mice). We used the p110* construct! (Extended Data Fig. 9). The p110* protein 
is a constitutively active chimaera that contains the iSH2 domain of p85 fused to 
the N terminus of p110 via a flexible glycine-linker. To generate tissue-specific 
p110*-transgenic mice, a cloned loxP-flanked neoR-stop cassette was inserted into 
a modified version of pROSA26-1 followed by the cDNA encoding p110* and 
then a frt-flanked IRES-EGFP cassette and a bovine polyadenylation sequence 
(R26StopFLP110*)}%. 

No statistical methods were used to predetermine sample size. The experiments 
were not randomized and the investigators were not blinded to allocation during 
experiments and outcome assessment. 

Animals were fed ad libitum and housed at a constant ambient temperature in 
a 12-h light cycle. Animal procedures were approved by the ‘Services Vétérinaires 
de la Préfecture de Police de Paris’ Departmental Director and by the ethical com- 
mittee of the Paris Descartes University. All appropriate procedures were followed 
to ensure animal welfare. In the first study, a single dose of tamoxifen (40 mg kg!) 
was administered by oral gavage when the mice were 21 days old. For the survival 
studies, the mice were followed daily after tamoxifen gavage (PIK3CA"? n= 16 and 
PIK3CACAGG-CreER  — 16). For the preventive studies, the mice were treated with 
BYL719 (MedChem Express; 50 mg kg”! in 0.5% carboxymethylcellulose (Sigma 
Aldrich), daily p.o.) (n= 16) or vehicle (0.5% carboxymethylcellulose (Sigma 
Aldrich), daily p.o.) (n= 16). For the therapeutic studies, the mice were treated 
with BYL719 (50 mg kg! in 0.5% carboxymethylcellulose (Sigma Aldrich), daily 
p.o.) (n= 12) or vehicle (0.5% carboxymethylcellulose (Sigma Aldrich), daily p.o.) 
(n= 12). Treatment was started at the time of tamoxifen gavage for the preventive 
study, or seven days later for the therapeutic study. 

In the second study, a single dose of tamoxifen (4mg kg ') was administered 
by oral gavage when the mice were 21 days old (PIK3CACAGG-CER yy — 28). Ten 
mice were killed at approximately 1 month after tamoxifen gavage when tumours 
reached a certain volume, for tissue examination. Once gross morphological 
abnormalities were visible, eighteen mice were treated with BYL719 for 15 days 
and treatment was stopped once macroscopic lesions disappeared to follow their 
phenotype. 

In the third study, a single dose of tamoxifen (40 mg kg!) was administered 
through oral gavage when the mice were 21 days old and the mice were then 
treated with daily intraperitoneal injection of rapamycin (MedChem Express, 
ref#HY- 10219) (PIK3CA@46¢-C"ER y — 7) or vehicle (PIK3CA“SO CER ny = 7) for 
30 days. Rapamycin was dissolved to a final concentration of 0.5mg ml! in abso- 
lute ethanol dissolved further in 5% polyethylene glycol (PEG-400) and 5% Tween 
80 in PBS and used for intraperitoneal injection at 4mg kg~!. All mice treated with 
rapamycin or vehicle were then killed for tissue examination. 

In the fourth study, a single dose of tamoxifen (4mg kg~!) was administered 

by oral gavage when the mice were at 21 days old (PIK3CAC46¢- CER n= 6). Once 
gross morphological abnormalities were visible (approximately 30 days later), all 
the mice were treated with daily intraperitoneal injection of rapamycin (MedChem 
Express, ref#HY-10219) for 30 days. Rapamycin was dissolved to a final concen- 
tration of 0.5mg ml"! in absolute ethanol dissolved further in 5% polyethylene 
glycol (PEG-400) and 5% Tween 80 in PBS and used for intraperitoneal injection 
at4mg kg! 
Cell cultures, plasmids and transfection. HeLa cells were grown at 37°C in 
Dulbecco's modified Eagle’s medium (DMEM) containing 10% fetal bovine serum 
(Sigma). Transient transfections were performed with cDNAs encoding wild-type 
PIK3CA (Addgene Inc., ID 16643), PIK3CA (Exon20 H1047R) (Addgene Inc., 
ID 16639) and PIK3CA (Exon9 E545K) (Addgene Inc., ID 16642) constructs and 
Myr-p110*-myc and Myr-p110*KR-myc constructs! using the Lipofectamine 2000 
transfection reagent following the manufacturer's instructions. Cells were lysed in 
RIPA buffer for protein extraction 48h after transfection and 12h after starvation. 
Each experiment was performed in duplicate and repeated at least four times. 

Skin samples were collected from the PIK3CA™? and PIK3CACAG@-CER mice 
using standard methods”. To generate dermal fibroblast cultures, skin sam- 
ples were minced and incubated at room temperature in 0.05% trypsin-EDTA 
(ThermoFisher) solution for 30 min with gentle shaking. Cells were collected by 
centrifuging at 700g for 10 min, resuspended in cell culture medium containing 
25% FBS, and plated onto 24-well plates to establish lines. Fibroblast cultures were 
grown and maintained in 1 x MEM (Corning) supplemented with 25% FBS and 
penicillin/streptomycin (Corning) to a final concentration of 100 IU penicillin 
and 500,1g ml~! streptomycin. 

Cells at similar population doubling were plated 1:4 from confluent cultures 
and allowed to grow until ~80% confluent. Then, medium was replaced with 


medium containing 4-OH tamoxifen (11M) (Sigma) for 48h to activate the 
Cre recombinase. Cells were then starved for 12h and exposed for 24h to either 
BYL719 (0.1 or 1.0j.mol1~!; MedChem Express) or rapamycin (10 or 100ng ml; 
Sigma) or an equal volume of DMSO. Cells were rinsed with 1x PBS. Protein 
lysates were collected by directly adding 1x RIPA buffer containing protease 
and phosphatase inhibitors. Each experiment was performed in duplicate and 
repeated at least four times 

Morphological analysis. Mouse tissues were fixed in 4% paraformaldehyde and 
paraffin embedded. 4-\1m liver sections were stained with periodic acid-Schiff 
(PAS), 4-|1m liver or spleen sections were stained with haematoxylin and eosin 
(HE), and 4-1m kidney sections were stained with Masson’s trichrome. Four-1m 
liver frozen sections were stained with Oil red O (Sigma Aldrich, ref# 00625). 
Immunohistochemistry and immunofluorescence. Paraffin-embedded kid- 
ney sections (41m) were incubated with anti-P-AKT (Ser*”) antibody (Cell 
Signaling Technology, ref#4051 dilution 1:100), anti-P-S6RP antibody (Cell 
Signaling Technology, ref# 5364, dilution 1:100), anti-a-smooth muscle cell anti- 
body (Sigma Aldrich, ref# A5228, dilution 1:100), anti-CD34 antibody (eBiosci- 
ence, ref# 14-0341, dilution 1:100), anti-CD31 antibody (Dianova, ref# Dia-310, 
dilution 1:30) or anti-podoplanin antibody (Agilent, ref#M3619, dilution 1:50). 
Immunofluorescence studies were analysed using a Zeiss LSM 700 confocal 
microscope. 

Proliferation assay. Proliferative cells were detected in tissues using Ki67 immu- 
nostaining (anti-Ki67 antibody, Thermo Fisher Scientific, ref# RM-9106-S1, dilu- 
tion 1:100). 4-1ym kidney sections were incubated with anti-Ki67 antibody, followed 
by a biotinylated mouse antibody (Vector) at 1:400 and a HRP-labelled streptavidin 
at 1:1,000. Staining was revealed by DAB. The proliferation index (PI) per tissue 
was calculated as the number of Ki67—positive nuclei for the total number of nuclei 
in 10 randomly selected fields (magnification 400). 

Apoptosis assay. Apoptosis was detected in 4-\1m sections of paraffin- 
embedded tissues (liver, spleen and heart) by TUNEL assay using the in situ Cell 
Death Detection kit (Roche) according to the manufacturer's protocol. The number 
of apoptotic cells was determined as the number of TUNEL-positive nuclei per 
field in 10 randomly selected fields (magnification x 400). 

8-galactosidase. Fresh tissues were briefly rinsed in PBS and then washed three 
times in PBS containing 5mM EGTA, 2mM MgCh, 0.02% nonidet and 0.01% 
Na desoxycholate. Samples were rinsed in PBS and then fixed in 20% formal- 
dehyde/2% glutaraldehyde for 45 min. Tissues were then immersed overnight 
in B-galactosidase staining solution. After rinsing in PBS, livers were fixed in 
4% formaldehyde and paraffin embedded. 4-\1m tissue sections were coun- 
terstained with haematoxylin and eosin. We then counted the number of cells 
with 6-galactosidase staining per field in 10 randomly selected fields 
(magnification x 400). 

mRNA analysis. mRNAs were quantified in mouse tissues by quantita- 
tive PCR with reverse transcriptase (RT-PCR) using an ABI PRISM 7700 
Sequence Detection system (Applied Biosystems). Primers were as fol- 
lows: p16 (fwd) 5’-GGCCAATCCCAAGAGCAGAG-3’ and (rev) 5’-GC 
CACATGCTAGACACGCTA-3’; RPL13 (fwd) 5’-CTCATCCTGTTCCC 
CAGGAA-3’ and (rev) 5‘-GGGTGGCCAGCTTAAGTTCTT-3’. RPL13 was used 
as the normalization control as previously described”®. 

Western blot. Western blots were performed as previously described”’. In brief, 
protein extracts from the liver, muscles, heart, kidneys and fibroblasts were resolved 
by SDS-PAGE before being transferred onto the appropriate membrane and incu- 
bated with anti-P-AKT (Ser*”*) antibody (Cell Signaling Technology, ref# 4060, 
dilution 1:1,000), anti-P-AKT (Thr*”) antibody (Cell Signaling Technology, ref# 
13038, dilution 1:1,000), anti- AKT antibody (Cell Signaling Technology, ref# 9272, 
dilution 1:1,000), anti-P-S6RP antibody (Cell Signaling Technology, ref# 5364, dilu- 
tion 1:1,000), anti-p110c (Cell Signaling Technology, ref# 4249, dilution 1:1,000), 
anti-LY VE-1 (R&D Systems, ref# AF2125, dilution 1:1,000), anti-P-p44/42 anti- 
body (Cell Signaling Technology, ref# 4370, dilution 1:1,000), anti-p44/42 (Thr20?/ 
Tyr?) antibody (Cell Signaling Technology, ref# 9102, dilution 1:1,000), anti- 
P-p38 (Thr'®°/Tyr!*) antibody (Cell Signaling Technology, ref# 9216, dilution 
1:1,000), anti-p38 antibody (Cell Signaling Technology, ref# 8690, dilution 1:1,000), 
anti-GFP antibody (Abcam, ref# ab13970, dilution 1:1,000), anti-GAPDH (Merck 
Millipore, ref#374, dilution 1:1,000) or anti--actin antibody (Sigma-Aldrich, 
ref#A2228, dilution 1:1,000), followed by the appropriate peroxidase-conjugated 
secondary antibody (dilution 1:10,000). Chemiluminescence was acquired using 
a Fusion FX7 camera (Vilbert Lourmat) and densitometry was performed using 
the BiolD software (Certain Tech). 

CT scan and MRI evaluation. High resolution micro-CT scans were performed 
at the Plateforme PIV, EA2496—Pathologie, Imagerie et Biothérapies orofaciales, 
Dental school Paris V. In brief, 4-week-old female PIK3CA" and PIK3CA©4G@-reER 
mice induced with 40 mg kg~! tamoxifen received either vehicle (n =3 
PIK3CA“? mice) or BYL719 (n=3 PIK3CA™" mice, n= 3 PIK3CACAC@reER 
mice) for 14 days. Additionally, PIK3CA“4SC CE mice (n= 3 mice) received 
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a single dose of 4mg kg ' tamoxifen one month earlier in order to induce 
the development of tumours and hypertrophic extremities. High-resolution 
micro-CT scans were performed under general anaesthesia before treatment 
and 7 and 14 days later. 

MRI scans were performed at the Plateforme IRM, INSERM U970, Centre de 
recherche cardiovasculaire de Paris. In brief, MRI scans were performed under 
general anaesthesia in 4-week-old female PIK3CA” (n =6) and PIK3CACAG¢-CreER 
(n=6) mice 7 days after they received a single dose of 40 mg kg! tamoxifen to 
induce Cre recombination. MRI scans were then repeated weekly in all mice for 
1 month. 

Flow cytometry. Spleens of PIK3CA“? (n= 12) mice, PIK3CACACCC"ER mice 
induced with 40 mg kg~! tamoxifen (n =6) and PIK3CA“4°¢-C"ER mice induced 
with 4 mg kg! tamoxifen (1 =6) were mechanically disrupted in PBS/SVF 2%. 
Following dissociation, spleens were filtered, centrifuged and resuspended. Cells 
were then treated with an FC blocker for 10 min at 4°C (Biolegend, ref# 101302) 
and fixed/permeabilized (BD Bioscience, ref# 554714). Cells were labelled with 
chicken polyclonal anti-GFP antibody (Abcam, ref# ab13970) for 30 min at 4°C. 
Subsequently, cells were incubated with Alexa Fluor 647-labelled goat anti-chicken 
IgY antibody (Abcam, ref# ab150171) for 30 min at 4°C. A background control 
incubated only with secondary antibody was also included. Samples were analysed 
using Gallios Flow Cytometer (Beckman Coulter) and FlowJo software (TreeStar). 
The mosaicism was assessed by the ratio of the number of GFP-positive cells to 
the total number of cells. 

Patients. The study was conducted on 19 patients (4 adults and 15 children) fol- 
lowed at Necker hospital. This protocol was approved by the ‘Agence Nationale 
de Sécurité du Médicament et des Produits de Santé (ANSM, authorizations 
n°: 553984-553986, 584018, 585881-586135, 585464, 585465, 585467, 585880, 
586136, 585463, 596229, 588018, 587904, 587896-587912, 587908, 587905, 587910, 
585458, 595374 and 587899) and informed written consent was obtained from the 
adult patients and the legal representatives of the children. BYL719 was compas- 
sionately offered by Novartis. Adult patients received 250 mg per day and child 
patients received 50 mg per day. BYL719 was taken orally every morning before 
breakfast. 
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Patients were followed at regular intervals: weekly for eight weeks, every two 
weeks for one month and then monthly. Glycaemia was evaluated daily for the first 
month and then progressively less frequently. At all time points, the patients had a 
physical examination and performance status measurement using the Karnofsky 
(on a scale from 0 to 100, with lower numbers indicating greater disability) and 
ECOG indexes (a scale of 0 to 5, with 0 indicating no symptoms and higher scores 
indicating increasing symptoms)**4. Growth of the children was monitored at 
all clinical appointments. Blood sampling (complete blood count, kidney and 
liver functions, glycated haemoglobin measurement) were performed at each 
time points. Glycaemia was monitored after all meals for two months and then 
the monitoring became progressively less frequent. Adverse events were graded 
according to National Cancer Institute Common Terminology Criteria for Adverse 
Events, version 4.0. All patients had heart ultrasound examination before BYL719 
treatment and then every three months. Magnetic resonance imaging studies were 
performed before BYL719 introduction and then after three and six months of 
treatment. PET scans were performed before and at 3 months of treatment. 

Data analysis and statistics. Data are expressed as mean +s.e.m. Survival curves 
were analysed with the Mantel-Cox (log-rank) test. Differences between the 
experimental groups were evaluated using ANOVA, followed when significant 
(P< 0.05) with the Tukey-Kramer test. When only two groups were compared, 
Mann-Whitney tests were used. Statistical analyses were performed using 
GraphPad Prism software. 

Reporting summary. Further information on experimental design is available in 
the Nature Research Reporting Summary linked to this paper. 

Data availability. All data generated or analysed during this study are included in 
this published article (and its Supplementary Information Files). 
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Extended Data Fig. 1 | p110* construction and mouse model 
characterization. a, Left, Representation of p110 and iSH2 domain of 
the p85 subunit (striped bar). The iSH2 domain is important to stabilize 
the p110qa protein. The p110* protein is a constitutively active chimaera 
that contains the iSH2 domain of p85 fused to the N terminus of p110 
via a flexible glycine linker’ (right). b, To generate tissue-specific 
p110*-transgenic mice, a cloned loxP-flanked neoR-stop cassette was 
inserted into a modified version of pROSA26-1 followed by the cDNA 
encoding p110* and then a frt-flanked IRES-EGFP cassette and a 
bovine polyadenylation sequence (R26StopFLP110*)'%. c,d, EGFP 
expression from flow cytometry experiments in the spleen of PIK3CA“? 
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mice (n= 12) and PIK3CACACG-CreER mice injected with either a single 

40 mg kg“! dose (c; n= 6 mice) or a single 4mg kg‘ dose (d; n =6 mice) 
of tamoxifen. Each curve is a different mouse. e, MRI examination of the 
PROS mouse model and efficacy of BYL719 treatment. Top, arrows show 
muscle hypertrophy in PIK3CA@C¢- CER mice before BYL719 treatment. 
This phenotype was reversed by BYL719 administration. Middle, arrows 
show scoliosis in PIK3CA@4°°-C"E® mice before BYL719 treatment, which 
was rescued by BYL719 administration. Bottom, arrows show arterial 
dilation in PIK3CA™4°@-CER mice before BYL719 treatment, which was 
reversed by BYL719 administration (n = 6 mice per group). 
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Extended Data Fig. 2 | Quantification and vessel malformation. picture of lymphatic malformation as assessed by podoplanin 
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BYL719 treatment presenting organ abnormalities. b, Oil Red O staining (n=8 mice per group). Scale bars, 10 jm. e, Representative western 
of the livers of PIK3CA? and PIK3CAC46@-C"ER mice demonstrating blot of LYVE-1 in the liver of PIK3CA™? and PIK3CAC4G@-CER mice 
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PIK3CACACG-CER mice with or without BYL719 (n= 8 mice per group). test (two-tailed, P= 0.001). PIK3CA°4C¢-CER versus PIK3CA™’ mice, 
PIK3CACAGG-CreER mice treated with vehicle showed vessel dilation that ED < 0,001. 


was prevented or reversed by BYL719. Scale bars, 101m. d, Representative 
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group). A.U., arbitrary unit. All data are shown as mean +s.e.m. ANOVA 
followed by Tukey-Kramer test (two-tailed). 
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Extended Data Fig. 5 | p110* expression in affected tissues. a, Western 
blot showing the expression of p110* in PIK3CA@46¢- CER mice (n=8 
mice per group). b, p110* is not expressed in the brain or lungs (n=8 
mice per group). c—e, Western blot quantification of Fig. 1d, in the liver (c), 
heart (d) and muscle (e) of PIK3CA™" and PIK3CA“46¢-CER mice treated 
with or without BYL719 (n =8 mice per group). All data are shown as 


mean + s.e.m. ANOVA followed by Tukey-Kramer test (two-tailed). 
PIK3CACACG-CER versus PIK3CA"! mice, ***P < 0.001. PIK3CA@4C@-CreER 
mice treated with vehicle versus PIK3CA“46°-C”ER mice treated with 
preventive BYL719, ***P < 0.001. PIK3CA“46C- CE mice treated with 
vehicle versusd PIK3CA©4°¢- CER mice treated with therapeutic BYL719, 
+++P< 0.001. 
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Extended Data Fig. 6 | Ability of BYL719 to inhibit PIK3CA activation and PIK3CA©46@-CreER mice treated with or without BYL719 (n= 8 mice 
in different tissues. Immunofluorescence staining of P-AKT (Ser*”*) and per group). Scale bars, 10,1m. 
P-S6RP in the liver (a), heart (b), spleen (c) and muscles (d) of PIK3CA“" 
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Extended Data Fig. 7 | See next page for caption. 
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Extended Data Fig. 7 | Recruitment of the AKT/mTORC pathway by 
the different forms of mutant p110. a, Western blot and quantification 
of p110, P-AKT (Ser‘”?), P-S6RP and GFP in HeLa cells transfected 

with plasmids containing cDNA encoding p110*, p110* kinase-dead 
mutant (p110* KD) as a control, H1047R mutation or E545K mutation. 
Cells transfected with the p110* mutant showed a more powerful effect 
on the activation of the AKT/mTORC pathway than the others (n = 4 
independent experiments). All data are shown as mean +s.e.m. ANOVA 
followed by Tukey-Kramer test (two-tailed). p110* versus H1047R 
mutation, ***P < 0,001. p110* versus E545K mutation, ***P < 0.001. 


p110* versus wild-type p110, ***+P < 0.001. p110* versus p110* KD, 

$88P < 0.001. Negative control is a vector that contains cDNA encoding 
GFP. b, Histological examination of different tissues from PIK3CACAGG-CreER 
mice. Left column, from top to bottom, liver, abdominal tumour, leg and 
ear abnormalities. Middle column, PAS or HE staining of the tissue. Right 
column, Ki67 staining of the same tissue (n = 8 mice). Scale bars, 201m. 

c, Design of the experiment shown in Fig. 2h. PIK3CAC4C@- CER mice 
received a single dose of 4mg kg”! tamoxifen and were followed for one 
month. Once the tumours became visible, BYL719 was started for two 
weeks and then withdrawn. 
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Extended Data Fig. 8 | CT scan evaluation of the tumours and adipose 
tissue before and after BYL719 introduction. a, Body weight evolution 
of PIK3CA“? and PIK3CA“S¢-C"ER mice treated with vehicle or BYL719 
(n=3 mice per group). b, CT scan evaluation and quantification of the 
fat tissue content in PIK3CA™? and PIK3CA“46C-C"ER mice treated with 
vehicle or BYL719. Subcutaneous and visceral fat content were measured 


PIK3CA CAGG-CreER 


Before BYL719 BYL719 during 7 days BYL719 during 14 days 


Tumors volume change (%) 


0 7 14 
Time (days) 


before treatment and 7 and 14 days after onset of treatment with vehicle or 
BYL719 (n=3 mice per group). c, CT scan evaluation and quantification 
of tumour volume in PIK3CA“C@- CER mice before and after two weeks of 
BYL719 treatment (arrows) (n =3 mice per group). All data are shown as 
mean +s.e.m. 
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PIK3CA“G6@-CrER mouse models. a, Kaplan-Meier survival curves mice treated with vehicle or rapamycin directly after Cre induction. 
of PIK3CACAC6G-CreER mice that received a single dose of 40 mg kg~! e, PIK3CACAGG-CreER mice were treated with rapamycin one month after 
tamoxifen and were treated with or without rapamycin after tamoxifen Cre induction with a single dose of 4mg kg! tamoxifen and followed 


administration. b, Representative pictures of the liver of PIK3CACAGG-CreER for one month. All data are shown as mean +s.e.m. ANOVA followed 
mice treated with rapamycin 40 days after Cre induction. c, Morphology of — by Tukey-Kramer test (two-tailed). PIK3CAC4G@ CR versus PIK3CA“? 
livers and spleens from PIK3CA“? and PIK3CA“6°-C”E® mice that were mice, ***P < 0.001. PIK3CA™°¢- "ER mice treated with rapamycin 
treated with or without rapamycin after Cre induction. Scale bars, 10 1m. compared with PIK3CA@46¢-CreER mice treated with vehicle, ***P < 0.001. 
d, Western blot and quantification of P-AKT (Ser‘”*) and P-S6RP in the 
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Extended Data Fig. 10 | In vitro effect of BYL719 and rapamycin 
on fibroblasts from PIK3CA©46¢-CER mice. a, Skin fibroblasts from 
PIK3CA"? and PIK3CA46C-C'ER mice were isolated and exposed to 
vehicle or increasing concentrations of BYL719 or rapamycin for 24h. 
b, Quantification. White column, without 4-OHT; black column, with 


4-OHT. All data are shown as mean + s.e.m. ANOVA followed by Tukey- 
Kramer test (two-tailed). Before versus after Cre induction with 4-OHT, 
*** P< 0.001. BYL719 or rapamycin exposure compared with cells treated 


with vehicle, **P <0.01 and ***P < 0.001. 
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Extended Data Fig. 11 | Effect of BYL719 in patients with PROS. 

a, Patients 10-17 before and after 180 days of BYL719 treatment. Patient 
10 was a 14-year-old boy with severe asthenia, dyspnea and bilateral 
overgrowth of lower limbs. After 180 days of treatment asthenia resolved 
and we observed a marked reduction in hypertrophy of the limbs. Patient 
11 was a 14-year-old boy with overgrowth of the right buttock and an 
intra-abdominal vascular tumour infiltrating the left kidney and spinal 
nerve. He had chronic haematuria and was permanently confined to bed 
owing to pain. After 180 days, haematuria resolved and the volume of 
the intraabdominal vascular malformation was reduced by up to 68%. 
He had no more pain and became capable of walking. Patient 12 was a 
15-year-old boy with multiple large tumours of the trunk and the back. 
After 180 days of treatment the tumours had reduced in size. Patient 

13 was a 16-year-old boy with megalencephaly-capillary malformation 
(MCAP) and left hemifacial hyperplasia. Treatment led to a reduction 

in hemifacial hypertrophy and cognitive improvement. Owing to the 
deformation, this patient was not able to open the left eye. After 180 
days of BYL719 treatment, he was able to open the eye (not shown for 
confidentiality reasons). Patient 14 was a 16-year-old girl with MCAP and 
a chronic noninfectious palpebral cellulitis who was steroid-dependent. 
BYL719 treatment led to the healing of the cellulitis and steroids were 


ARTICLE 


stopped without a flare. We also observed enhancement of cognitive 
function and behaviour and improvement of scoliosis. Patient 15 was a 
19-year-old man with overgrowth of the left foot and unstable and painful 
walking. Treatment led to an improvement in the overgrowth as well as an 
improvement in walking distance. Patient 16 was a 32-year-old man with 
overgrowth of the right foot and unstable and painful walking. Treatment 
led to an improvement in the overgrowth as well as an improvement in 
walking distance. Patient 17 was 50-year-old woman with generalized 
hypertrophy, and severe and diffuse pain with opioid dependency. She was 
permanently confined to bed. After six months of treatment we observed 
an improvement in tiredness, and resolution of pain, and we were able 

to stop opioids within two weeks. The patient became able to walk again. 
b, PIK3CA mutations identified in the 17 patients. c, For each patient we 
determined a target lesion (see Supplementary Table 2) that was clinically 
measured at each time point. The graph represents the changes (per cent) 
during the 180 days of treatment with BYL719. Each line is a single patient. 
d, Mean body weight changes (per cent) during the 180 days of treatment 
with BYL719 (n= 13 patients, patients 1-13), excluding the four obese 
patients (patients 14, 15, 16 and 17). e, Mean body weight loss in the four 
obese patients during the 180 days of treatment with BYL719. All data are 
shown as mean + s.e.m. 
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Extended Data Fig. 12 | Height changes in children during treatment 
period and radiological changes with BYL719 treatment. a, Height 
changes in children during the 180 days of treatment with BYL719. b, MRI 
scans of patient 1 before and after 180 days of BYL719 treatment. Arrows 
show the target lesion. c, Three-dimensional MRI-based reconstruction 

of the chest tumour in patient 1 before and after 180 days of BYL719 
treatment. d, Examples of MRI showing the evolution of the target lesions 
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in patients 9 and 11. e, Volume evolution of the radiological target lesion 
after 180 days of BYL719 treatment. f, Diffusion MRI demonstrating the 
enhancement of brain perfusion in patient 14 after 180 days of BYL719. 

g, PET scan images of patients 6, 9, 15 and 17, before and after 90 days of 


BYL719 treatment. The arrows delineate hypermetabolic activity before 
and after 90 days of treatment. 
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Structure of the j:-opioid receptor-G; 


protein complex 


Antoine Koehl!’, Hongli Hu>??, Shoji Maeda?’, Yan Zhang!?, Qianhui Qu!, Joseph M. Paggi!?3.4, Naomi R. Latorracab?3+5, 
Daniel Hilger?, Roger Dawson®, Hugues Matile®, Gebhard F. X. Schertler”®, Sebastien Granier’, William I. Weis), 
Ron O. Dror!?345, Aashish Manglik!0!*, Georgios Skiniotis!?* & Brian K. Kobilka?* 


The j- opioid receptor (j:OR) is a G-protein-coupled receptor (GPCR) and the target of most clinically and recreationally 
used opioids. The induced positive effects of analgesia and euphoria are mediated by ,OR signalling through the adenylyl 
cyclase-inhibiting heterotrimeric G protein G;. Here we present the 3.5 A resolution cryo-electron microscopy structure 
of the »OR bound to the agonist peptide DAMGO and nucleotide-free G;. DAMGO occupies the morphinan ligand pocket, 
with its N terminus interacting with conserved receptor residues and its C terminus engaging regions important for 
opioid-ligand selectivity. Comparison of the ,,OR-G; complex to previously determined structures of other GPCRs bound 
to the stimulatory G protein G, reveals differences in the position of transmembrane receptor helix 6 and in the interactions 
between the G protein a-subunit and the receptor core. Together, these results shed light on the structural features that 


contribute to the G; protein-coupling specificity of the »OR. 


The OR is the primary target of morphine and many clinical opioid 
analgesics’. Binding of opioids to the ,,OR leads to clinically desired 
analgesic and antitussive actions, but also important negative side 
effects, including addiction and potentially lethal respiratory suppres- 
sion. Opioids have become the most prescribed class of medication in 
the United States”, which has led to a national epidemic of addiction 
and an unprecedented level of drug overdose deaths. 

Like other GPCRs, the OR achieves many of its physiological 
actions by stimulating signalling via a heterotrimeric G protein. While 
other GPCRs have been shown to signal through more than one 
G-protein subtype, the OR signals almost exclusively through the 
adenylyl cyclase-inhibitory family of G proteins (Gi.)*. The analge- 
sic activity of opioids is driven by G-protein activation’, but activated 
OR can also interact with B-arrestins, recruitment of which has been 
associated with the respiratory depression induced by many opioids™*. 
Recently developed molecules that favor G; signalling over arrestin 
recruitment display analgesic efficacy with reduced side effects, sug- 
gesting that different signalling pathways can be selectively targeted 
to yield unique physiological outcomes”*. Although a framework for 
GPCR interactions with the stimulatory G protein G, has recently 
been enabled by X-ray crystallography” and cryo-electron microscopy 
(cryo-EM)'"' studies, the structural basis of GPCR signalling through 
other G-protein subtypes remains undefined. To better understand 
the mechanism of selective activation of G; by the OR, we sought to 
determine the structure of the 1»OR-G; complex. 


35h cryo-EM map of a »jOR-G; complex 

DAMGO (H-Tyr-p-Ala-Gly-N(Me) Phe-Gly-OH) is a pOR-selective 
synthetic analogue of the natural peptide agonist enkephalin. DAMGO- 
bound OR was incubated with G;; heterotrimer, and the complex 
was treated with the nucleotide hydrolase apyrase to remove GDP. 


The resulting nucleotide-free complex was further stabilized by a 
single-chain variable fragment (scFv16) that binds to heterotrimeric 
G; (Extended Data Fig. 1) and prevents GTP\S-mediated dissociation 
of nucleotide-free complexes. We applied single-particle cryo-EM to 
initially obtain a three-dimensional map of the j;OR-DAMGO-G;- 
scFv16 complex at an indicated nominal resolution of 3.6 A (Extended 
Data Figs. 2, 3, Extended Data Table 1). Notably, scFv16 binds a com- 
posite interface comprised of the aN helix of Ga; and the B-propeller 
of G®, a site that is more than 20 A distal to the wLOR-Ga; interface and 
does not perturb the interface between Ga and Gf subunits (Extended 
Data Fig. 1). Subtraction of the scFv16 signal from raw particle images 
led to an improved map with an indicated global resolution of 3.5 A. 
This map displayed enhanced features, particularly in the transmem- 
brane core of the receptor (Extended Data Figs. 2-4), which enabled 
the high-resolution visualization of the j,OR-G; interface and ligand 
binding. Accordingly, we employed this improved 3.5 A map to 
examine interactions between 1OR and DAMGO, and between jOR 
and G; (Fig. 1a, b). 


Activation of OR by a peptide agonist 

The active-state crystal structure of jOR bound to the morphinan 
agonist BU72 and an active-state stabilizing nanobody (Nb39) has 
been determined at a resolution of 2.2 A’’. Similar to other small 
molecule morphinans, BU72 is rigidified by a complex ring system; 
in contrast to flexible opioid peptides, such as DAMGO, that have 
multiple rotatable bonds. Our cryo-EM map includes well-defined 
features for most amino acids forming the orthosteric binding pocket 
(Extended Data Fig. 4a). Despite differences in agonist structure, the 
conformation of the active-state ligand-binding pocket and the ori- 
entation of the amino acids that interact with the agonist are highly 
similar between the 1OR bound to either BU72 or DAMGO (Fig. Ic), 
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Fig. 1 | Cryo-EM structure of the ,OR-G; complex. a, Orthogonal views 
of the cryo-EM density map of the 1,OR-G; heterotrimer complex coloured 
by subunit. Green, ,OR; orange, DAMGO; gold, Go; Ras-like domain; 
cyan, GB; purple, Gy. b, Model of the ,,OR-G; complex in the same views 
and colour scheme as shown in a. c, Residues that line the OR orthosteric 
binding pocket are shown as sticks for the ,OR-G; complex (green) and 
the pOR-Nb39 complex (PDB code 5C1M; blue). The binding pocket 
residues of OR in complex with DAMGO and BU72 show nearly identical 
conformations, despite differences in ligand structure. d, Comparison 


suggesting that the OR recognizes structurally distinct agonists in a 
stereotypic manner. 

Although DAMGO is a flexible ligand, we observe density for the 
entire peptide bound to the receptor (Fig. 1a, Extended Data Figs. 3, 4). 
The N terminus of DAMGO occupies a similar position in the binding 
pocket as BU72. By contrast, the C terminus of DAMGO extends ~8 A 
further towards the extracellular loops compared to BU72 (Fig. 1d, e). 
To identify stable atomic-level interactions between DAMGO and 
the binding pocket, we performed molecular dynamics simulations. 
In over 1 ps of simulation, DAMGO remained close to its initially 
modelled pose, with the N-terminal portion largely remaining con- 
fined to the experimentally determined cryo-EM density (Fig. 1f, 
Extended Data Fig 5). The N terminus of DAMGO maintained a 
persistent salt bridge with D147°*”, a feature previously observed in 
structures of morphinans bound to opioid receptors (Fig. le; super- 
scripts indicate Ballesteros—Weinstein numbering for GPCRs’). 
The same amine group also frequently formed a hydrogen bond with 
326’. More generally, the N-terminal Tyr of DAMGO overlaps 
the phenolic groups of other small-molecule opioids that have been 
characterized in complex with ,,OR or other opioid receptors by X-ray 
crystallography!*"!”. 

Molecular dynamics simulations also revealed a water-mediated 
hydrogen bonding network that closely overlaps with the water network 
observed in the high-resolution crystal structure of |,OR’? (Extended 
Data Fig. 6). In particular, the simulations revealed a stable, water- 
mediated interaction that is formed between the phenol of DAMGO 
and H297°*. Although the crystal structure of the tOR bound to 
BU72 shows two water molecules bridging the DAMGO phenol and 
H297°*?, simulations of tOR bound to DAMGO and other phenolic 
ligands*/” suggest that one of these water molecules rapidly dissoci- 
ates, and that a single water is required for stable ligand binding. This 
interaction is a hallmark of opioid recognition that has been observed 
for morphinans in complex with the pOR!*" as well as other small 
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of BU72 (yellow carbons) in the orthosteric pocket of the pOR-Nb39 
complex (blue) with DAMGO (orange carbons) in the orthosteric pocket 
of the »OR-G; complex (green). e, view of DAMGO in the orthosteric 
binding pocket with critical residues shown. f, A frame from every 100 ns 
of a 11s molecular dynamics simulation (yellow at t = 0, fading to red at 
t=1,s) shows that the first four residues of DAMGO (bottom) are stable, 
whereas the C-terminal Gly-ol (top) is dynamic but frequently returns to 
the modelled pose. 


molecule and peptide-mimetic agonists of the homologous 6- and 
k-opioid receptors (6OR and KOR)!*!%!8, 

DAMGO exhibits more than 500-fold selectivity for the pOR over 
the SOR and KOR”. Structural studies have shown that interactions of 
ligands with the extracellular loops encode ligand subtype specificity 
among closely related opioid receptors’*. Indeed, DAMGO selectivity 
for LOR over OR has been shown to depend on residues in extracellu- 
lar loop (ECL) 1, whereas selectivity over KOR results from differences 
in ECL3”°, The map density for the C-terminal residues of DAMGO is 
slightly weaker than for the amino terminus, consistent with increased 
mobility of this region in simulations (Fig. 1f, Extended Data Fig 5). In 
our model, the N(Me)Phe side chain of DAMGO occupies a conserved 
hydrophobic pocket near ECL1 and the Gly-OH group folds back over 
the ligand (Fig. le). This model is consistent with the high affinity of 
pOR binding to cyclized enkephalins, which bridge the +2 and +5 
positions of the peptide”!. 


Structure of G;-stabilized active pOR 

The overall structure of Gj-bound OR is similar to the active confor- 
mation of the BU72-bound OR stabilized by Nb39"? (root mean square 
deviation of 1 A) with a predominant outward displacement of trans- 
membrane helix (TM) 6 from the heptahelical bundle relative to the inac- 
tive state (Fig. 2a, b). A number of highly conserved residues in the GPCR 
family have been shown to be important for receptor activation, including 
the D2-9R3-50y3.51 N749p7-0X KX y753 and conserved core triad (Pr, p>.50, 
F**) motifs. The conformation of each of these regions in the |,OR-G; 
complex is virtually identical to the active state observed in the complex 
with Nb39 (Fig. 2c). The structural similarity of 1}OR between Nb39 and 
G,-bound states indicate that these changes underlie ligand-mediated 
activation and are not specific to a particular intracellular binder. Indeed, 
Nb39 and G; promote a similar increase in agonist affinity’’, supporting 
a common mechanism of allosteric communication between the intra- 
cellular G-protein-coupling domain and the ligand-binding pocket!” 
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Fig. 2 | Structural changes in the OR stabilized by nucleotide-free 

G;. a, Comparison of inactive 1OR (brown) and the G;-stabilized active 
state of tOR (green). b, Comparison of Nb39-stabilized and G,-stabilized 
active states of the ,,OR (blue and green, respectively). The structures are 
nearly identical except for a slight shift of TM6 towards TM7 in the G;- 
bound state. c, Residues important for activation of the pOR show nearly 
identical conformations despite the difference in ligands. d, Comparison 
of G,-stabilized 8,AR (orange) and G,-stabilized 1OR (green). While most 
transmembrane helices align well between the two receptors, TM6 is kinked 
further outward by 9 A in the B2AR; this distance is calculated between 
positions of C, of residue 6.29 (Ballesteros- Weinstein numbering) in TM6. 


Two differences between Nb39- and G;-stabilized active states of »OR 
are particularly notable. First, compared with the nanobody-stabilized 
active-state 1OR, TM6 in the tOR-G; complex is further displaced 
by 3 A towards TM7 (Fig. 2b). Second, the conformation of intracel- 
lular loop (ICL) 3 is different between the two structures (Fig. 2b). It 
is likely that the specific ICL3 conformation of Nb39-stabilized jOR 
reflects interactions that are unique to the nanobody rather than a 
general feature of receptor activation prior to G-protein coupling. A 
similar difference in ICL3 conformation was previously observed for 
the 8,-adrenergic receptor (8,AR) between nanobody” (Nb80) and G,- 
coupled states. The comparison of the G-protein-bound states of both 
receptors shows that TM6 of BAR is displaced outward by a further 
9Ain comparison to that of the »OR (Fig. 2d). 


Structural changes in G; 
The quality of the cryo-EM map enabled accurate modelling of G; in 
its nucleotide-free state, providing insight into the structural changes 


on | 


Fig. 3 | Changes in G; upon coupling to the OR. a, b, Comparison of 
GDP-bound Ga; (PDB code 1GP2, grey) and nucleotide-free Ga; from 
the ,,OR-G; complex (gold). GDP is shown as blue spheres in a and sticks 
in b and c. The primary differences between these two structures are the 
opening and outward movement of the alpha helical domain (AHD), and 
an upward shift of the «5-helix by 6 A to engage the receptor core. The 
a-carbons of the TCAT motif are represented as spheres in b. The TCAT 


C GDP-bound Go, 
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that underlie nucleotide release. The changes are similar to those seen 
in nucleotide-free G, in complex with other GPCRs. The most striking 
difference between the GDP-bound” and nucleotide-free heterotrimer 
in complex with OR involves the separation of the «-helical domain 
(AHD) from the Ras-like domain in Go, (Fig. 3a). Owing to its relative 
flexibility, we excluded the AHD density from the high-resolution map 
refinement. The dynamic character of the AHD has been observed pre- 
viously in spectroscopic and structural studies of complexes between 
receptors and G,”!! and G,*”°. Displacement of the AHD disrupts 
several contacts with GDP and is necessary, but not sufficient, for 
nucleotide release, a process that involves breaking additional contacts 
with the Ras domain”. 

Coupling of G; to the OR also involves a 6 A translation as well as a 
60° rotation of the a5-helix of Ga, into the receptor core (Fig. 3b). This 
movement has been shown to be essential for nucleotide release by G;*. 
In particular, the movement of «5 leads to a change in the position of 
the 86-a5 loop containing the conserved TCAT motif that forms direct 
interactions with the guanine base of GDP. This displacement disrupts 
key contacts between the G protein and nucleotide. Furthermore, the 
observed translation and rotation of the «5-helix requires the displace- 
ment of the fully conserved F336 away from the hydrophobic pocket 
formed by residues in the 82 and 83 strands and the «1 helix”® (Fig. 3b). 
Movement of the «5-helix is also propagated to the phosphate-binding 
P loop connecting the $1 strand and the al helix by disruption of 
a hydrophobic network between the a1 and a5 helices (Fig. 3b-d). 
Correspondingly, upon transition of G; to the nucleotide-free state, we 
observe a 4A shift of cl towards the «5-helix in G; whereby the hydro- 
phobic contacts are replaced by polar interactions with the 86-a5 loop 
as it is released from its guanine-binding position (Fig. 3c, d). These 
changes are in contrast to those observed in structures of Gs-coupled 
complexes, in which a1 not only becomes more unstructured, but also 
tends to lose interactions with the a5-helix (Fig. 3e). Our structure is 
consistent with previous studies suggesting that engagement of aGPCR 
with the a5-helix and aN-{1 loop leads to concerted changes in the a1 
helix and P loop that destabilize contacts with the guanine nucleotide, 
leading to its release’’. 


Structural insights into G;-coupling specificity of the »OR 
Although the OR couples exclusively to Gi.°, many GPCRs can 
couple to multiple G-protein subtypes. A well-studied example is the 
B.AR, which couples to both G, and Gi. Previous sequence-level anal- 
yses have failed to identify a linear GPCR epitope that determines 
G-protein-coupling specificity, suggesting that specificity is likely 
to be determined by a more complex three-dimensional network 
of interactions. Globally, the structure of the pOR-G; complex is 
similar to that of the B2.AR-G, complex; this is likely to reflect a 
similarity in the conformation of nucleotide-free states of family 
A GPCR-G-protein complexes. The primary interaction sites in 


© Nucleotide-free Go, 
1372 


a5 


©) 


motif coordinates the guanine base of GDP. The upward shift of the 
a5-helix and repositioning of the TCAT motif leads to nucleotide release. 
c, d, e, The interface between the a1 helix and the N-terminal end of the 
a5-helix and TCAT motif for GDP-bound Gq, (c), nucleotide-free Ga; (d) and 
nucleotide-free G, from the 82AR-G, complex (e). The upward movement 
of the a5-helix disrupts the interaction between the a1- and a5-helices, 
leading to changes in the P loop that coordinates the phosphates of GDP. 
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B,AR-Go., 
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Fig. 4 | Comparison of the receptor-G-protein binding interfaces of the 
pOR-G; and 8,AR-G, complexes. a, Comparison of the conformation of 
the «5-helix of Ga and receptor TM6 in B,AR-G, and tOR-G; complexes 
after alignment on the receptor. b, Interactions between ICL2 of the ,}OR 
(green) and Ga; (gold). Asp 350 of Ga, is depicted with narrow lines to 
indicate uncertainty in its conformation due to poor cryo-EM density for 
its side chain. c, Interactions between ICL2 of the 8.AR (orange) and Ga, 
(blue). d, Surface view of the hydrophobic pockets in Ga, (top panel) and 
Gas (bottom panel) that interact with a non-polar amino acid in ICL2 of 
the pOR and 82AR, respectively. 


both complexes occur between ICL2, ICL3 and TM3, TM5 and TM6 
on the receptor and the aN, aN-81 loop and «5-helix on the Ga 
subunit of the G protein (Fig. 4). The most striking differences 
between the 3,AR-G, and 1OR-G; complexes are in the relative 
position of the a5-helix of both G proteins and the corresponding 
shift in the position of TM6 in the receptor. The «5-helix of Gaj is 
rotated ~21° relative to the a5-helix of Gay, leading to a5 A displace- 
ment of the extreme C terminus of the Ga, helix «5 toward TM7 
of the pOR (Fig. 4a). This difference in a5 positioning is associated 
with a smaller outward displacement of the }OR TM6. The C-terminal 
residues of «5 that interact with TM5 and TM6 of the receptor are 
bulkier in G, than in G;, with tyrosine and glutamate in place of 
cysteine and glycine at positions —4 and —3 from the C terminus, 
respectively. Accordingly, substitution of these two amino acids of G, 
into G; would lead to steric clashes with TM3 and the TM7-helix 8 
loop (Extended Data Fig. 7). In the G,-coupled family B calcitonin! 
and GLP-1!° receptors, G-protein coupling is associated with a 
large kink of TM6 at the conserved PXXG motif, which produces an 
even larger outward displacement of TM6 than that observed in the 
82,AR-G, complex. 


Fig. 5 | Comparison of the receptor-G-protein binding interfaces of the 
pOR-G; and 8,AR-G, complexes. a, c, Interactions between ICL3 of pOR 
and Gq; (a) and between the cytosolic ends of TM3, TM5 and TM6 of the 
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Surprisingly, the structure of 1OR-G; shows substantial similarity 
to an active-state structure of the visual pigment rhodopsin (metar- 
hodopsin II) in complex with a modified peptide derived from the 11 
C-terminal residues of the a-subunit of the visual G protein transducin 
(Ga,CT2)?8 (Extended Data Fig. 8). Despite the absence of the rest of 
the heterotrimeric G protein in the metarhodopsin II-Ga,CT2 struc- 
ture, the conformation of TM6 of metarhodopsin II is highly similar 
to that of the pOR, and the location of the Ga,CT2 peptide is almost 
identical to that of the C terminus of G; in complex with OR. This 
finding is consistent with observations showing that substitution of 
the last five amino acids of the a5-helix of Ga is sufficient to change 
G-protein-coupling specificity”. 

In Extended Data Table 2, we list amino acids in the ,,OR that interact 
with the cytoplasmic surface of G;. The ICL2 of pOR primarily forms 
interactions with the aN and a5 helices of Goi, including a key ionic 
interaction between D177**°° of 1OR (G Protein Coupled Receptor 
Data Base (GPCRDB) numbering”) in ICL2 and R32 in the aN-61 
loop of Go, (Fig. 4b). Although D**°° in ICL2 is conserved in all opi- 
oid receptors with available sequences (GPCRDB*}), it is variable in 
most other G;-coupled receptors. Another notable interaction involves 
R179**°” in ICL2 of pOR, which simultaneously coordinates the highly 
conserved D164? in the D?°R?°Y?*! motif and potentially forms an 
additional interaction with D350 (—5 position) in the «5-helix of Ga; 
(Fig. 4b). This arginine residue is essential for ,OR-induced G; signal- 
ling, as the polymorphic variant R179C abolishes signalling in vitro” 
and leads to insensitivity to morphine in patients homozygous for the 
mutation*’. The potential role of this interaction network in G-protein 
coupling is supported by the preponderance of basic residues (arginine 
and lysine) at this position in most G;-coupled receptors, whereas G,- 
coupled receptors contain alternative residues at the equivalent position 
(Extended Data Table 2). 

A further group of contacts occurs between P1723 and V173*4*! of 
pOR and a hydrophobic patch on Go; comprised of residues F336, 1343, 
1344 and T340 on the o5-helix and L194 on the 82-63 loop (Fig. 4b, d). 
In the GDP bound state, these «5-helix residues are buried by the adja- 
cent 32 and 33 loops. Coupling to a receptor involves an upward shift of 
the a5-helix and exposes these residues to form a shallow hydrophobic 
pocket that interacts with pOR V173***! in ICL2 (Fig. 4b, d). In the case 
of G,, a deeper hydrophobic pocket in this region engages the bulky 
aromatic F139°**! in ICL2 of the 82AR (Fig. 4c, d). 

In the OR, ICL3 stabilizes the interface between receptor and G protein 
through two sets of interactions: one set involves multiple contacts with 
a hydrophobic patch on the a5-helix of Gay, while another engages the 
86 strand of Go; through a network of charged residues (Fig. 5a, c). The 
hydrophobic interface formed by ICL3 is similar in both the »OR and 
B2AR; in the 82AR, TMS is helically extended to form a larger hydro- 
phobic interaction around nonpolar residues in the a5-helix of Ga, 
(Fig. 5b). The shorter ICL3 of the ,OR does not form a similar helical 
extension, but it nevertheless fulfills the same role. Residues V262>, 
M264 and L265 fold back to form a hydrophobic patch that interacts 
with hydrophobic residues on the a5-helix of Ga, (Fig. 5a). 


TMS. ¥252558 


i / = P)° | ih 
OR and the a5-helix of G; (c). D350 of Ga; is depicted with narrow lines 


to indicate uncertainty in its position due to poor cryo-EM density for its 
side chain. b, d, The analogous interfaces between 82AR and G, (b, d). 
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The second set of polar contacts involves R263 of wOR and the back- 
bone carbonyl of 1319 on the 86 strand of Ga; (Fig. 5a). Mutations of 
R263 reduce, but do not abolish, G; signalling**, which is consistent 
with the potential importance of stabilizing the 36 strand of Ga, in 
the observed conformation. There is no analogous interaction in the 
B2AR-G, complex (Fig. 5b). This additional recognition interface may 
be necessary for efficient ,,OR-G; coupling owing to the higher affinity 
for GDP of G; relative to Gs. Compared to G,-coupled receptors, addi- 
tional interactions with the 86 strand in G,-coupled receptors may be 
required to disrupt interactions between the Ras domain and GDP for 
efficient nucleotide exchange. 

The cytosolic ends of pjOR TM3, TM5 and TM6 further stabilize the 
nucleotide-free conformation of the «5-helix by interacting with highly 
conserved residues in the distal C terminus of Ga, (Fig. 5c). In particular, 
C351 (—4 position) of Go, is in close proximity to the cytosolic end of 
TM3 of wOR. This cysteine residue has previously been identified as 
the site of pertussis toxin-mediated inhibition of Gi, family proteins 
by enzymatic ADP ribosylation®®. The close apposition of C351 to the 
cytoplasmic surface of 1OR highlights how the addition of a bulky mod- 
ification at this position can completely inhibit receptor coupling and 
nucleotide exchange*® . In addition to this interaction, OR residues 
M255**!, 127893, M281°* and V282°*” form a hydrophobic pocket that 
engages the absolutely conserved Go, residue L353 (—2 position) in the 
05-helix. M255°*! and M281°°* have previously been observed in NUR 
experiments to respond to activation by DAMGO*, suggesting that this 
region undergoes conformational changes prior to G-protein coupling. 
Further stabilization, however, is likely to be provided by a hydrogen 
bond between R277°* and the backbone carbonyl of L353 (Fig. 5b). 
Notably, interactions between the C terminus of the «5-helix and the 
receptor core are entirely different in the 8,AR-G, complex (Fig. 5d). 

Our findings provide structural insights into why »OR does not 
couple to G,, but do not explain the mechanism of G-protein- 
coupling specificity across all GPCRs. It is possible that coupling 
specificity is determined at an intermediate step in the formation of 
a GPCR-G-protein complex, such as the initial interactions between 
the GDP-bound G protein and the agonist-bound receptor. Recent 
single molecule fluorescence studies provide evidence for a transient 
intermediate complex between GDP-bound G, and the B2AR that is 
associated with a smaller outward movement of TM6°”. Previous stud- 
ies suggest that amino acids C-terminal to helix 8 confer coupling spec- 
ificity for Gg in the M3 muscarinic receptor (M3R)**. Given that there 
are no interactions between the C termini of the B2AR or WOR and their 
respective G proteins in the nucleotide-free complexes, we hypothesize 
that engagement of G, with the C terminus of M3R may occur at an 
earlier stage in complex formation. Thus, the nucleotide-free GPCR-G- 
protein complex may be preceded by one or more GDP-bound inter- 
mediates characterized by dynamic low-affinity interactions with the 
receptor. Such ‘initial encounter’ complexes may show larger energetic 
differences among interactions with various G-protein subtypes than 
the nucleotide-free state, and would thereby contribute more critically 
to coupling specificity. The transient nature of such interactions, how- 
ever, poses challenges for structure determination by crystallography 
or cryo-EM. 
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METHODS 


No statistical methods were used to predetermine sample size. The experiments 
were not randomized and the investigators were not blinded to allocation during 
experiments and outcome assessment. 

Purification of jOR. These studies utilized a previously described mouse OR 
construct with cleavable N- and C-terminal domains”. In brief, the receptor 
was expressed in Spodoptera frugiperda Sf9 insect cells using the baculovirus 
method (Expression Systems), extracted from insect cell membranes with 
n-dodecyl-8-p-maltoside (DDM, Anatrace), and purified by nickel-chelating 
sepharose chromatography. The Ni-NTA eluate was loaded onto M1 anti-Flag 
immunoaffinity resin and washed with progressively lower concentrations of the 
antagonist naloxone. The |1OR was then eluted in a buffer consisting of 20mM 
Hepes pH 7.5, 100mM NaCl, 0.1% DDM and 0.01% cholesterol hemisuccinate 
(CHS) supplemented with 50 nM naloxone, Flag peptide and 5mM EDTA. The 
monomeric fraction was purified by size exclusion chromatography on a Superdex 
200 10/300 gel filtration column (GE Healthcare) in 20 mM Hepes pH 7.5, 100 mM 
NaCl, 0.1% DDM, 0.01% CHS, and 11.M DAMGO. A further twofold molar excess 
of DAMGO was added to the preparation and the resulting agonist-bound OR 
preparation was concentrated to ~100)1M. 

Expression and purification of heterotrimeric G;. Heterotrimeric G; was 
expressed and purified as previously described. In brief, Trichuplusia ni Hi5 
insect cells were co-infected with two viruses, one encoding the wild-type human 
Goa, subunit and another encoding the wild-type human 3), subunits with an 
octahistidine tag inserted at the amino terminus of the 8; subunit. Cultures were 
harvested 48 h post infection. Cells were lysed in hypotonic buffer and lipid-modified 
heterotrimeric G; was extracted in a buffer containing 1% sodium cholate. The 
soluble fraction was purified using Ni- NTA chromatography, and the detergent 
was exchanged from cholate to DDM on a column. After elution, the protein was 
dialyzed against a buffer containing 20 mM Hepes pH 7.5, 100mM NaCl, 0.015% 
DDM, 100\1M TCEP, 101M GDP, and concentrated to ~20 mg/ml for further 
complexing with the ,OR. 

Generation of scFv16. Six to eight week old female Balb/c mice were immunized 
with a purified rhodopsin-G; complex*®. Hybridoma cells were prepared using 
splenocytes of immunized mice using standard methods in combination with 
PAI myeloma cells (RRID: CVCL J288). Clones that showed a positive reaction to 
purified rhodopsin(N2C/D282C/M257Y)-Gi; complex in an ELISA assay and by 
immunoprecipitation were further characterized as monoclonal antibodies or Fab 
fragments. Fab-16 was selected from the initial pool of clones because it prevented 
dissociation of the rhodopsin(N2C/D282C/M257Y)-G;; complex by GTP)S, and 
therefore acted as a stabilizing chaperone in the same manner as Nb35 for G,. The 
full sequences of constructs used are listed in Supplementary Fig. 1. All animal 
studies were performed at Roche Innovation Center in Basel according to ethical 
guidelines and internal IRB approval. All cell lines were obtained from the manu- 
facturer and tested for contamination. 

A C-terminal hexahistidine-tagged single chain construct of Fab16 (scFv16) 
was cloned into a modified pVL1392 vector containing a GP67 secretion signal 
immediately prior to the amino terminus of the scFv, expressed in secreted form 
from Trichuplusia ni Hi5 insect cells using the baculovirus method, and purified 
by Ni-NTA chromatography. Supernatant from baculovirus-infected cells was pH 
balanced by addition of Tris pH 8.0. Chelating agents were quenched by addition of 
1mM nickel chloride and 5 mM calcium chloride and incubation with stirring for 
lhat 25°C. Resulting precipitates were removed by centrifugation and the super- 
natant was loaded onto Ni-NTA resin. The column was washed with 20 mM Hepes 
pH 7.5, 500mM NaCl, and 10 mM imidazole followed by a low salt wash comprised 
of the same buffer substituted with 100 mM NaCl. Following elution with the same 
buffer supplemented with 250 mM imidazole, the C-terminal hexahistidine tag 
was cleaved by incubation with human rhinovirus 3C protease, and the protein 
was dialyzed into a buffer consisting of 20mM Hepes pH 7.5 and 100mM NaCl. 
Cleaved scFv16 was further purified by reloading over Ni-NTA resin. The flow- 
through was collected and purified over gel filtration chromatography using a 
Superdex 200 16/60 column. Monomeric fractions were pooled, concentrated, 
and flash frozen in liquid nitrogen until further use. 

Formation and purification of the 1j»OR-G,-scFv16 complex. Purified DAMGO- 
bound OR was mixed with a 1.2 molar excess of G; heterotrimer. The coupling 
reaction was allowed to proceed at 24°C for 1h and was followed by addition of 
apyrase to catalyze hydrolysis of unbound GDP, which destabilizes the nucleotide- 
free complex*®. After one more hour at 25°C, a fourfold volume of 20 mM 
Hepes pH 7.5, 100mM NaCl, 1% lauryl maltose neopentyl glycol (L-MNG), 
0.1% CHS was added to the complexing reaction to initiate detergent exchange. 
After 1h incubation at 25°C to allow micelle exchange, 1 mM MnCl, and lambda 
phosphatase (New England Biolabs) were added to dephosphorylate the prepara- 
tion. This reaction was further incubated at 4°C for 2h. To remove excess G protein 
and residual DDM, the complexing mixture was purified by M1 anti-Flag affin- 
ity chromatography. Bound complex was first washed in a buffer containing 1% 
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L-MNG, followed by washes in gradually decreasing L-MNG concentrations. The 
complex was then eluted in 20mM Hepes pH 7.5, 100 mM NaCl, 0.01% L-MNG, 
0.001% CHS, 300nM DAMGO, 5mM EDTA and Flag peptide. The eluted complex 
was supplemented with 100|1M TCEP to provide a reducing environment. The 
tobacco etch virus (TEV) protease and human rhinovirus 3C protease were added 
to cleave the flexible |tOR N and C termini. Finally, a 1.2 molar excess of scFv16 
was added to the preparation. Once cleavage of the termini was confirmed by 
SDS-PAGE, the ,,OR-G;-scFv16 complex was purified by size exclusion chroma- 
tography on a Superdex 200 10/300 column in 20 mM Hepes pH 7.5, 100mM NaCl, 
300nM DAMGO, 0.00075% L-MNG and 000025% glyco-diosgenin (GDN) with 
0.0001% CHS total. Peak fractions were concentrated to ~7 mg/ml for electron 
microscopy studies. 
Cryo-EM and 3D reconstructions of |s,OR-G;-scFv16 complex. Three microlitres 
of purified |,OR-G;-scFv16 complex was applied to glow-discharged 200-mesh 
grids (Quantifoil R1.2/1.3) and subsequently vitrified using a Vitrobot Mark IV 
(Thermo Fischer Scientific). Cryo-EM imaging was performed on a Titan Krios 
operated at 300kV at a nominal magnification of 130,000 using a Gatan K2 
Summit direct electron camera in counted mode, corresponding to a pixel size of 
1.04A. A total of 2642 image stacks were obtained with a defocus range of —0.8 to 
—2.6um. Each stack movie was recorded for a total of 8s with 0.1s per frame. The 
dose rate was 5e~/A2/s, resulting in an accumulated dose of 40 electrons per A 
Dose-fractionated image stacks were subjected to beam-induced motion cor- 
rection using MotionCor2*!. A sum of all frames, filtered according to exposure 
dose, in each image stack was used for further processing. CTF parameters for each 
micrograph were determined by Gctf v1.06”. Particle selection, two-dimensional 
and three-dimensional classification, and 3D reconstruction were performed using 
RELION2.1%°, apart from the last round of local refinement and reconstruction 
that was performed with Frealign*’. Semi-automated selected 893,426 particle 
projections were subjected to reference-free 2D classification and averaging using 
a binned dataset with a pixel size of 2.08 A. Particles (379,373) with well-defined 
averages were subjected to further processing. An ab initio map generated by 
VIPER* was used as initial reference model for maximum-likelihood-based 3D 
classification, which, however, did not produce classes with notable differences. 
Thus, all 379,373 particle projections were subjected to 3D refinement, producing a 
map at 4.3 A resolution. The dataset was further reduced by removing particle pro- 
jections from micrographs with resolution lower than 4.5 A, resulting in a dataset 
of 359,406 particles that were subjected to refinement and reconstruction after sub- 
tracting densities for the mobile Ga «helical domain and the detergent micelle!!. 
Particle projection assignments from RELION were imported into Frealign*® for 
a final round of local refinement and reconstruction. To prevent overfitting, the 
resolution limit for each alignment iteration never exceeded the 0.9 value of the 
Frealign calculated Fourier shell correlation (FSC). The map was further improved 
after additionally subtracting densities corresponding to the ScFv from the raw 
particle projections!!. The indicated resolution, using Phenix ‘gold standard’ FSC”, 
of the final reconstruction is 3.5 A and 3.6 A at FSC 0.143 for the ScFv-subtracted 
map and the ScFv-including map, respectively. Local resolution was determined 
using the Bsoft package“* with unfiltered half maps as input. 
Model building and refinement. The building of a full atomic model for the 
jtOR-G; complex was aided by the quality and resolution of our map, as well as 
the existence of high-resolution crystal structures of each of the components that 
make up the complex. A composite model was formed by rigid body fitting of the 
active-state |tOR (PDB code 5C1M)"” with nanobody removed, as well as the Ras 
domain and 8+ subunits of GDP-bound G; (PDB 1GP2)”*. The «5-helix of Go; was 
removed and manually fitted to the density, and the final eight residues that were 
missing from the extreme C terminus of the 1GP2 structure were manually built 
in Coot. This starting model was then subjected to iterative rounds of automated 
refinement in Rosetta®® and Phenix real space refine*”, and manual building in 
Coot”. In the regions of the model for which side-chain density was too weak to 
unambiguously assign a conformation, we stubbed residues to their Cg position, 
while preserving sequence information (Supplementary Fig. 2, 3). The final model 
was visually inspected for general fit to the map, and geometry was further eval- 
uated using Molprobity”! as part of the Phenix suite of software. Initial restraints 
for DAMGO were generated using the PRODRG server™. To further refine the 
pose of DAMGO, we chose a pose from molecular dynamics simulation consist- 
ent with our map and then performed a refinement using Phenix. This involved 
manually editing the residue and atom names from a CHARMM parameter file to 
match the three-letter codes and atom names from the RCSB. In particular, DAL 
for p-alanine, MEA for N-methyl phenylalanine, and ETA for Gly-ol C terminus. 
Additional, custom, restraints were generated to maintain planarity of the final 
peptide bond between MEA and ETA as a supplement to the natural library of 
Phenix amino acid restraints. Model overfitting was evaluated through its refine- 
ment against one cryo-EM half map after randomly displacing all atoms by 0.2 A. 
FSC curves were calculated between the resulting model and the half map used for 
refinement ( red curve, Extended Data Fig. 3b, c), as well as between the resulting 
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model and the other half map for cross validation (green curve, Extended Data 
Fig. 3b, c), and also against the full map ( black curve, Extended Data Fig. 3b, c). 
The final refinement statistics for both models are provided in Extended Data 
Table 1. 

System setup for molecular dynamics simulations. Molecular dynamics 
simulations were initiated from an earlier refinement of the structure reported 
in this study after removing the G protein and ScFv fragment. Prior to beginning 
simulations, Schrédinger Glide*? was used to relax DAMGO to an energetically 
favourable conformation. The initial DAMGO pose is depicted in Extended Data 
Fig. 3. We performed five independent simulations, for each of which initial atom 
velocities were assigned randomly and independently. Prime (Schrédinger) was 
used to model missing side chains, and neutral acetyl and methylamide groups 
were added to cap protein termini. Titratable residues remained in their dominant 
protonation state at pH 7, as determined using PropKa, except for D?°° and D>”, 
which were protonated. Our simulations incorporated the waters from the 5C1M 
crystal structure. 

The prepared protein structures were aligned to the ‘orientation of proteins 

in membranes’ (OPM) structure for PDB entry 5C1M™. The aligned structures 
were then inserted into a pre-equilibrated palmitoyl-oleoyl-phosphatidylcholine 
(POPC) bilayer using Dabble, a simulation preparation software°’. Sodium and 
chloride ions were added to neutralize each system at a concentration of approxi- 
mately 150mM. Bilayer dimensions were chosen to maintain at least a 30 A buffer 
between protein images in the x-y plane and a 20 A buffer between protein images 
in the z direction. Final system dimensions were approximately 80 x 75 x 90 A®. 
Simulation times for each replicate were approximately 1 1s. 
Molecular dynamics simulation protocols. We used the CHARMM36m force 
field for proteins, lipids and ions and the TIP3P model for waters**°. Parameters 
for the non-canonical residues in DAMGO were determined by analogy to 
N-methyl glycine for assigning N-methyl parameters to N-methyl phenylalanine 
(residue 4) and by analogy to serine to assign parameters to the Gly-ol capping 
group (residue 5). CMAP terms for p-alanine were inverted from those for 
L-Alanine to account for the inverted chirality of the residue. 

We performed the simulations using the Compute Unified Device Architecture 
(CUDA) version of Particle-Mesh Ewald Molecular Dynamics (PMEMD) in 
AMBER on one or two graphical processing units (GPUs)*". Simulations were 
performed using the AMBER16® software. Three rounds of minimization were 
performed, each consisting of 500 iterations of steepest descent minimization, fol- 
lowed by 500 iterations of conjugate gradient descent minimization, with harmonic 
restraints of 10.0, 5.0 and 1.0kcal mol~! A~? placed on the protein and lipids. 
Systems were heated from 0K to 100K in the NVT ensemble over 12.5 ps and then 
from 100K to 310K in the NPT ensemble over 125 ps, using 10.0kcal mol! A~? 
harmonic restraints applied to lipid and protein heavy atoms. Systems were then 
equilibrated at 310 K in the NPT ensemble at 1 bar, with harmonic restraints on all 
protein heavy atoms tapered off by 1.0kcal mol”! A~ starting at 5.0kcal mol! A~? 
in a stepwise fashion every 2ns for 10 ns and then by 0.1 kcal mol~! A~? ina step- 
wise fashion every 2 ns for 20 ns. Production simulations were performed in the 
NPT ensemble at 310K and 1 bar, using a Langevin thermostat for temperature 
coupling and a Monte Carlo barostat for pressure coupling. These simulations 
used a 4-fs time step with hydrogen mass repartitioning®’. Bond lengths to hydro- 
gen atoms were constrained using SHAKE. Simulations used periodic boundary 
conditions. Non-bonded interactions were cut off at 9.0 A, and long-range elec- 
trostatic interactions were computed using Particle Mesh Ewald (PME) with an 
Ewald coefficient of approximately 0.31 A and an interpolation order of four. The 
FFT grid size was chosen such that the width of a grid cell was approximately 1 A. 

During production simulations, all residues within 5 A of the G-protein inter- 

face were restrained to the initial structure using 5.0kcal mol“! A~* harmonic 
restraints applied to non-hydrogen atoms. Using such restraints reduces the overall 
system size, enabling more simulation, while ensuring that the receptor maintains 
an active conformation throughout the simulation. 
Analysis protocols for molecular dynamics simulation. Trajectory snapshots 
were saved every 200 ps during production simulations. The AmberTools17 
CPPTRAJ package was used to reimage and center trajectories“. Simulations 
were visualized and analysed using Visual Molecular Dynamics (VMD)®. In 
two simulations, DAMGO was trapped in an unstable binding pose, wherein the 
water-mediated interaction between the DAMGO tyrosine residue and H297 failed 
to form during equilibration, and instead a direct hydrogen bond between these 
residues was formed. Our analysis is based on the other three simulations, in which 
DAMGO’ pose was consistent with the cryo-EM density. Water occupancy maps 
were generated using AmberTools17 GIST. Frames from every 1 ns of simula- 
tion, excluding the first 400 ns, aligned to the initial structure, were used as input. 
The grid size was set to 0.25 A. The resulting map was smoothed using a Gaussian 
filter with a standard deviation of two grid cells. 


Reporting summary. Further information on experimental design is available in 
the Nature Research Reporting Summary linked to this paper. 

Data availability. All data generated or analysed during this study are included 
in this published article and its Supplementary Information. Sequences of con- 
structs used in this study are listed in Supplementary Fig. 1. The cryo-EM density 
maps for the ,OR-G; complex with, and without, scFv16 have been deposited in 
the Electron Microscopy Data Bank (EMDB) under accession codes EMD-7868 
and EMD-7869, respectively. The coordinates for the models of jOR-G; with, 
and without, scFv-16 have been deposited in the Protein Data Bank (PDB) under 
accession numbers 6DDE and 6DDF respectively. 


39. Maeda, S. et al. Crystallization scale preparation of a stable GPCR signaling 
complex between constitutively active rhodopsin and G-protein. PLoS One 9, 
e987 14 (2014). 

40. Westfield, G. H. et al. Structural flexibility of the Gas a-helical domain in the 

B2-adrenoceptor Gs complex. Proc. Nat! Acad. Sci. USA 108, 16086-16091 

(2011). 

Al. Zheng, S. Q., Palovcak, E., Armache, J.-P., Verba, K. A., Cheng, Y. & Agard, D. A. 

MotionCor2: anisotropic correction of beam-induced motion for improved 

cryo-electron microscopy. Nat. Methods 14, 331-332 (2016). 

42. Zhang, K. Gctf: Real-time CTF determination and correction. J. Struct. Biol. 193, 

1-12 (2016). 

43. Scheres, S. H.W. RELION: implementation of a Bayesian approach to cryo-EM 

structure determination J. Struct. Biol. 180, 519-530 (2012). 

44. Grigorieff, N. FREALIGN: high-resolution refinement of single particle structures. 

J. Struct. Biol. 157, 117-125 (2007). 

45. Penczek, P.A., Grassucci, R. A. & Frank, J. The ribosome at improved 

resolution: new techniques for merging and orientation refinement in 3D 

cryo-electron microscopy of biological particles. U/tramicroscopy 53, 251-270 

(1994). 

46. Grigorieff, N. Frealign: an exploratory tool for single-particle Cryo-EM. Methods 

Enzymol. 579, 191-226 (2016). 

47. Adams, P. D. et al. The Phenix software for automated determination of 

macromolecular structures. Methods 55, 94-106 (2011). 

48. Heymann, J. B. & Belnap, D. M. Bsoft: image processing and molecular 

modeling for electron microscopy. J. Struct. Biol. 157, 3-18 (2007). 

49. Emsley, P. & Cowtan, K. Coot: model-building tools for molecular graphics. 

Acta Crystallogr. D 60, 2126-2132 (2004). 

50. Wang, R. Y.-R. et al. Automated structure refinement of macromolecular 
assemblies from cryo-EM maps using Rosetta. eLife 5,352 (2016). 

51. Williams, C. J. et al. MolProbity: More and better reference data for improved 
all-atom structure validation. Protein Sci. 27, 293-315 (2018). 

52. Schuttelkopf, A. W. & van Aalten, D. M. F. PRODRG: a tool for high-throughput 
crystallography of protein-ligand complexes. Acta Crystallogr. D 60, 1355-1363 
(2004). 

53. Friesner, R.A. et al. Glide: a new approach for rapid, accurate docking and 
scoring. 1. Method and assessment of docking accuracy. J. Med. Chem. 47, 
1739-1749 (2004). 

54. Lomize, M.A., Lomize, A. L., Pogozheva, |. D. & Mosberg, H. |. OPM: 
orientations of proteins in membranes database. Bioinformatics 22, 623-625 
(2006). 

55. Betz, R. Dabble (v.2.6.3). https://doi.org/10.5281/zenodo.836914 (2017). 

56. Best, R. B., Mittal, J., Feig, M. & Mackerell, A. D. Inclusion of many-body effects in 
the additive CHARMM protein CMAP potential results in enhanced cooperativity 
of a-helix and 6-hairpin formation. Biophys. J. 103, 1045-1051 (2012). 

57. Best, R. B. et al. Optimization of the additive CHARMM all-atom protein force 
field targeting improved sampling of the backbone y, t and side-chain x and 
x2 dihedral angles. J. Chem. Theory Comput. 8, 3257-3273 (2012). 

58. Huang, J. & MacKerell, A.D. CHARMM36 all-atom additive protein force field: 
validation based on comparison to NMR data. J. Comput. Chem. 34, 2135-2145 
(2013). 

59. Klauda, J. B. et al. Update of the CHARMM all-atom additive force field for lipids: 
validation on six lipid types. J. Phys. Chem. B 114, 7830-7843 (2010). 

60. Huang, J. et al. CHARMM36m: an improved force field for folded and 
intrinsically disordered proteins. Nat. Methods 14, 71-73 (2016). 

61. Salomon-Ferrer, R., Gotz, A. W., Poole, D., Le Grand, S. & Walker, R. C. Routine 
microsecond molecular dynamics simulations with AMBER on GPUs. 2. Explicit 
solvent particle mesh Ewald. J. Chem. Theory Comput. 9, 3878-3888 (2013). 

62. Case, D. A. et al. Amber (v.16). http://ambermd.org (2018). 

63. Hopkins, C. W., Le Grand, S., Walker, R. C. & Roitberg, A. E. Long-time-step 
molecular dynamics through hydrogen mass repartitioning. J. Chem. Theory 
Comput. 11, 1864-1874 (2015). 

64. Roe, D. R. & Cheatham, T. E. Ill. PTRAJ and CPPTRAJ: software for processing 
and analysis of molecular dynamics trajectory data. J. Chem. Theory Comput. 9, 
3084-3095 (2013). 

65. Humphrey, W., Dalke, A. & Schulten, K. VMD: visual molecular dynamics. J. Mol. 
Graph. 14, 33-38 (1996). 

66. Nguyen, C. N., Young, T. K. & Gilson, M. K. Grid inhomogeneous solvation theory: 
hydration structure and thermodynamics of the miniature receptor cucurbit[7] 
uril. J. Chem. Phys. 137, 044101 (2012). 

67. Nguyen, C., Gilson, M. K. & Young, T. Structure and thermodynamics of 
molecular hydration via grid inhomogeneous solvation theory. Preprint at 
https://arXiv.org/abs/1108.4876 (2011). 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


ARTICLE 


Nucleotide Free Gs 


GB 


Extended Data Fig. 1 | Binding characteristics of scFv. a,b, scFv 16 does _ subunit), there is a ~7° rotation of G+ relative to the Ga, switch II domain 


not perturb the interfaces between Ga and Gf at its binding epitope (a) when compared to the GDP-bound form. d, This rotated conformation is 
or the switch II region located ~40 A away (b). Our structure is coloured similar to that observed in nucleotide-free G, coupled to the B2AR (PDB 
by chain, whereas the structure of GDP-bound Gi; heterotrimer (PDB code 3SN6). 


code 1GP2) is coloured grey. c, In the nucleotide-free state (coloured by 
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Extended Data Fig. 2 | Cryo-EM data processing. a, Representative corresponding density of the unmasked reconstruction. 3D density maps 
cryo-EM image of the 1OR-G; complex. Scale bar, 20 nm. b, Representative coloured according to local resolution. d, ‘Gold standard’ FSC curves from 
2D averages showing distinct secondary structure features from different Phenix indicate overall nominal resolutions of 3.5 A and 3.6 A using the 
views of the complex. c, Flow chart of cryo-EM data processing. The FSC = 0.143 criterion for the scFv-subtracted map (green curve) and 
unmasked map in the middle of the chart has been coloured by subunit. scFy-retained maps (purple curve), respectively. 


The inset shows the fit of the crystal structure of the a-helical domain in the 
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Extended Data Fig. 3 | Cryo-EM map versus refined structure. cryo-EM map (full dataset, black) of the outcome of model refinement 
a, Cryo-EM density map (scFv subtracted) and model are shown for all with a half map versus the same map (red), and of the outcome of model 
seven transmembrane a-helices of the j;OR, DAMGO, and Ga helices refinement with a half map versus the other half map (green). The results 
a5 and aN. b, ¢, Cross-validation of model to cryo-EM density map. The of the scFv-retained model versus map (b) and of scFv subtracted model 
model was refined against one half map after displacement of atoms by versus map (c) are shown. 


0.2 A, and FSC curves were calculated between this model and the final 
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Extended Data Fig. 4 | Selected cryo-EM densities of 1,OR-G; complex. and cytoplasmic ends of the }tOR transmembrane helices (d). These figures 
a-d, Cryo-EM density (displayed as mesh) surrounding residues involved | accompany the models shown in Figs. le, 4b, 5a and 5c, respectively. 
in DAMGO binding (a), 1OR-Go,; interaction around ICL2 (b), ICL3 (c), 
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Extended Data Fig. 5 | Stability of DAMGO in molecular dynamics C-terminal residues over the course of simulation. Cryo-EM density for 
simulations. a, Over the course of molecular dynamics simulations, the DAMGO is shown as mesh. b, Root mean standard deviations (RMSDs) 
positions of the first four residues of DAMGO do not significantly change, from the modelled pose of DAMGO to the pose during molecular 
while the fifth residue (Gly-ol) shows significant variability in position. dynamics simulations. The RMSD calculations include heavy atoms on the 
Frames from the first and last 100 ns are shown with an intermediate to peptide backbone. Data from three independent simulations are plotted. 
highlight both the relative stability of the first four amino acids and the The RMSDs for residues 1 to 4 (black) and the C-terminal Gly-ol (blue) are 
flexibility of the fifth. Arrows show the extent of motion in the N- and plotted separately to highlight their stability and mobility, respectively. 
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Regions of high water density in the DAMGO binding pocket during MD simulation 
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Extended Data Fig. 6 | Water occupancy in orthosteric binding site. 
Left, water occupancy in molecular dynamics simulations of DAMGO- 
bound OR overlaid with a representative conformation from molecular 
dynamics simulations. Occupancy relative to bulk solvent is the ratio of the 
rate at which water is observed in a given volume to the rate at which water 
is expected to be observed in an equivalent volume in the bulk solvent. 


8 i452» 


For example, blue regions (occupancy ratio = 2) are occupied by 

water twice as often as an equivalent region in the bulk solvent. Right, 
crystallographic waters in the BU72-bound OR binding pocket (PDB 
code 5C1M). Waters are shown as black spheres, BU72 is shown as yellow 
sticks, and hydrogen bonds are shown as dashed lines. 
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Extended Data Fig. 7 | Comparison of the C termini of Ga, and Ga,. 

The C terminus of Ga, is bulkier than that of Ga; owing to substitution of 
small amino acids C (—4 position) and G (—3 position) in Ga; to Y and E, 
respectively, in Gas. This leads to steric clashes with TM3 and TM7 of the 
WOR. Top, ribbon view of LOR (green) with wild-type Ga, (gold, left) and 


a Gays model (right) created by substituting C and G for Y and E based on 
the 8,AR-G, crystal structure. Substituted positions are coloured in light 
purple. The —4 to —2 positions have their side chains shown as spheres, 
and the rest are shown as a ribbon. Bottom, space-filling view of the }OR 
showing the steric clashes that result from these substitutions. 
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Extended Data Fig. 8 | Comparison of Ga; C-terminal peptide binding 11-residue Gtransducin (Gy) C-terminal peptide (dark purple) (PDB code 


modes. a-c, Side (top) and cytoplasmic (bottom) views of the »OR 3PQR) (c). The pOR-G; complex aligns best with the metarhodopsin 
(green) with the last 11 residues of Ga; (gold) alone (a), compared to the II-G, complex in terms of both TM6 displacement and position of the 
82AR(orange) with the last 11 residues of Ga, (light purple) (PDB code a5 peptide. 

3SN6) (b), or compared to metarhodopsin II (pink) in complex with an 
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Extended Data Table 1 | Cryo-EM data collection, refinement and validation statistics 


Data collection and processing 
Magnification 
Voltage (kV) 
Electron exposure (e—/A2) 
Defocus range (um) 
Pixel size (A) 
Symmetry imposed 
Initial particle images (no.) 
Final particle images (no.) 
Map resolution (A) 

FSC threshold 
Map resolution range (A) 


Refinement 
Initial model used (PDB code) 


Model resolution (A) 
Model resolution range (A) 


Map sharpening B factor (A?) 

Model composition 
Non-hydrogen atoms 
Protein residues 


Ligands 
B factors (A2) 
Protein 
Ligand 
R.m.s. deviations 
Bond lengths (A) 
Bond angles (°) 
Validation 
MolProbity score 
Clashscore 
Poor rotamers (%) 
Ramachandran plot 
Favored (%) 
Allowed (%) 
Disallowed (%) 


HLOR-G; Complex 
(EMDB-7869) 
(PDB 6DDF) 


48,076 
300 


40 
=0,8°"'=2.6 
1.04 

C1 
893,426 
359,406 
3.5A 
(0.143) 
3.3-4.5 


5C1M 
1GP2 
3.5 


3.3-4.5 
Pre -90, post -60 


6986 

886 residues (6949 
atoms) 

1 (37 atoms) 


33.23 
31.27 


0.007 
1.303 


1.93 
7.77 
0.72 


91.54 
8.35 
0.11% 


LWOR-Gj-scFv16 
Complex 
(EMDB-7868) 
(PDB 6DDE) 


48,076 
300 


40 
-0.8~ -2.6 
1.04 

C1 
893,426 
359,406 
3.6A 
(0.143) 
3.3-4.5 


5C1M 
1GP2 
3.6 


3.3-4.5 
Pre -90, post -60 


8731 

1119 residues 
(8694 atoms) 
1 (37 atoms) 


69.81 
100.66 


0.010 
1.328 


1.83 
6.97 
0.46 


93.02 
6.89 
0.09% 
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Extended Data Table 2 | Sequence alignment of residues that form the interaction interface between OR and G; 


Coupling Branch 


1103 Cem [Rizo [T1800 ik271 |R277 [racy 


2.39 3.54 34.50 34.51 34.55 34.57 4.38 5.61 6.26 6.32 6.33 


Mu-Opioid Receptor Residue 


[Human] 5-HT1A receptor Gi 
[Human] 5-HT1B receptor Gi 
[Human] M2 receptor Gi 
[Human] M4 receptor Gi 
[Human] a2A-adrenoceptor Gi 
[Human] FPR1 Gi | 7 eee N | OR 
[Human] FPR2/ALX Gi 
[Human] GAL1 receptor Gi 
[Human] GAL3 receptor Gi 


[Human] 6 Opioid Receptor Gi 
[Human] « Opioid Receptor Gi 
[Human] jw Opioid Receptor Gi 


Q@RRRRRRRRRRRRRRRReEQRRAR 


[Human] NOP receptor Gi 

[Human] SST1 receptor Gi 

[Human] SST2 receptor Gi 

[Human] SST3 receptor Gi 

[Human] SST4 receptor Gi 

[Human] SST5 receptor Gi 

[Human] CCR1 Gi 

[Human] CCR4 Gi 

[Human] CXCR4 Gi 

[Human] A1 receptor Gi 

[Human] f1-adrenoceptor Gs Qa 

[Human] [2-adrenoceptor Gs Q 

[Human] MC1 receptor Gs a | - Ta G 
[Human] MC2 receptor Gs a - Ka@ilc 
[Human] MC4 receptor Gs a iw - Eaiic 
[Human] A2A receptor Gs a | i Ae 
[Human] H2 receptor Gs a P 1 A Ea 
[Human] TA1 receptor Gs a P Eee ee | BE 
[Human] RXFP1 Gs 5 > iz MBM: et 
[Human] RXFP2 Gs 5 > IZ M Baa ve 
[Human] V2 receptor Gs B Pp Mic Eg #64v CE 
[Human] 5-HT2A receptor Gq a P | EG fs | t|s | k Im 
[Human] 5-HT2B receptor Gq a P a CO | 1s | tT [Tt [eR Ie 
[Human] M1 receptor Gq p t A i” + $ Hama. 
[Human] M3 receptor Gq p ct A i” + $ Hana. 
[Human] M5 receptor Gq a hn am | ia 
[Human] a1A-adrenoceptor Gq Qa P os. Ei | Pia kK | kK Oe 
[Human] GAL2 receptor Gq oy Pp t ETRE. Fev 
[Human] OX1 receptor Gq B P i a | T| BM of Kk {Tt 
[Human] OX2 receptor Gq B mms Po 1 Bm alk [Tl 
[Human] NK1 receptor Gq B 7 i = = a = | s Mime Oo | k ae 


Receptors from different branches of the GPCR family with different coupling specificities were selected for analysis. Sequences and alignment were performed using GPCRDB (http://www.gpcrdb.org) 
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G-protein-coupled receptors comprise the largest family of mammalian transmembrane receptors. They mediate 
numerous cellular pathways by coupling with downstream signalling transducers, including the hetrotrimeric G proteins 
G, (stimulatory) and G; (inhibitory) and several arrestin proteins. The structural mechanisms that define how G-protein- 
coupled receptors selectively couple to a specific type of G protein or arrestin remain unknown. Here, using cryo-electron 
microscopy, we show that the major interactions between activated rhodopsin and G; are mediated by the C-terminal 
helix of the Gj a-subunit, which is wedged into the cytoplasmic cavity of the transmembrane helix bundle and directly 
contacts the amino terminus of helix 8 of rhodopsin. Structural comparisons of inactive, Gj-bound and arrestin-bound 
forms of rhodopsin with inactive and G,-bound forms of the 82.-adrenergic receptor provide a foundation to understand 
the unique structural signatures that are associated with the recognition of G,, Gj and arrestin by activated G-protein- 


coupled receptors. 


The selective coupling of cell-surface receptors with specific intracellu- 
lar effector proteins is a fundamental step in transmembrane signalling. 
G-protein-coupled receptors (GPCRs) constitute the largest protein 
family of transmembrane receptors with more than 800 members in 
humans’. These receptors signal primarily through intracellular G 
proteins and arrestin proteins. Compared to the vast diversity of the 
GPCR family members and their physiological functions, the number 
of intracellular signalling transducers are much limited, with only four 
major types of G protein” (Fig. 1a) and two major types of arrestin’. 
These distinct signalling transducers regulate the generation ofa vari- 
ety of secondary messengers and activate various downstream kinases, 
which lead to diverse cellular signalling pathways and physiological 
consequences. Thus, the selective coupling of a specific transducer 
protein by a GPCR is crucial for cellular signalling and responses to 
extracellular stimuli. 

Our understanding of GPCR signalling has been greatly enhanced 
by the remarkable progress in GPCR structural biology, including the 
determination of over 170 structures from 40 unique GPCR members 
in active or inactive states and in complex with G, or arrestin’. In the 
inactive states, the cytoplasmic end of the seven-transmembrane-helix 
domain (TMD) is closed*-’, thus preventing GPCRs from effective cou- 
pling with cellular signalling transducers. Agonist binding induces con- 
formational changes in the TMD, including an outward movement at 
the cytoplasmic end of transmembrane helix 6 (TM6)*"’, which opens 
a cavity in the TMD bundle for interaction with one or more specific 
transducer proteins. This mechanism is particularly highlighted by the 
notable outward movement (14-18 A) at the cytoplasmic side of TM6 in 
the G,-bound B,-adrenergic receptor (8AR) structure!* and the recent 
cryo-electron microscopy (cryo-EM) structures of G,-bound class B 


GPCRs!*'®, Combination of the structural observations with sequence 
analyses has revealed a barcode system in G proteins for GPCR-G- 
protein binding selectivity'’. However, less is known about the 
mechanisms determining selectivity on the receptor, in particular 
the subset of G;-coupled receptors, despite the fact that they constitute the 
largest fraction of GPCR proteins’. 

Rhodopsin is a prototypical GPCR that plays a crucial role in light 
perception and has served as a model system for studying GPCR sig- 
nalling'**°. In the pre-illuminated state, rhodopsin adopts an inactive 
conformation stabilized by the inverse agonist, 11-cis-retinal. Light 
induces isomerization of the retinal ligand to the all-trans-retinal (ATR) 
configuration, which activates the receptor. The activated rhodopsin 
is coupled to the G protein transducin (G;) (Fig. 1a), which is a G; 
homologue”!’, to initiate the light-sensing signalling pathways. To 
understand the structural basis for the selective coupling of G; by a 
GPCR, we determined the structure of the rhodopsin-G; complex at a 
near atomic resolution using cryo-EM. 


Cryo-EM of the rhodopsin-G; complex 

To prepare a stable rhodopsin-—G; complex for structural studies, we 
took advantage of a constitutively active form of rhodopsin (termed 
4M, containing mutations N2N*"™C, E113°8Q, M257°°Y, N2822C3¢; 
superscripts denote Ballesteros—Weinstein numbering), that was pre- 
viously used to determine the structure of the rhodopsin-arrestin 
complex’?* In cell-based assays, the constitutively active rhodopsin 
activates the G; coupling pathway more strongly than the wild-type 
receptor (Fig. 1b). Importantly, wild-type and constitutively active rho- 
dopsin do not activate the G, pathway regardless of the presence of ATR 
(Extended Data Fig. 1c). Because previous studies indicated that the 
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Fig. 1 | Assembly of the rhodopsin-G; protein complex. a, Schematic 
illustration of G-protein-mediated GPCR signalling by the four types 

of G protein. Light-activated rhodopsin is specifically coupled to the 

Gijoyt Subtype. b, G; signalling activated by the 4M mutant and wild-type 
(WT) rhodopsin as measured by a serum response element (SRE)-driven 
luciferase reporter assay (n = 3). Data are mean + s.d. c, Experimental 
DEER distance distributions of rhodopsin in the presence (red line) 

or absence (black line) of Fab. Y74R1-Q225R1, Y74R1-R252R1 and 
Y74R1-M308R1 denote R1 nitroxide pairs. d, Iso-surface rendering of the 
cryo-EM density map for the rhodopsin-G;-Fab complex. 


active rhodopsin-G; complex is more stable than the active rhodopsin- 
G, complex”, we focused on obtaining a stable complex of activated 
rhodopsin bound to a dominant-negative mutant form of G; (Extended 
Data Fig. 1a, b), which was used to promote the nucleotide-free form 
of G,°°. To further stabilize the rhodopsin-G, complex, we screened 
a panel of antibodies against the dominant-negative G; from a phage 
display Fab library by negative-stain electron microscopy to identify 
one Fab fragment, termed Fab_G50, which stabilized the rhodopsin- 
G; complex. The conformational state of rhodopsin in complex with 
G; and Fab was surveyed by high-resolution distance mapping with 
site-directed spin labelling and double electron—electron resonance 
(DEER) spectroscopy in detergent micelles (Fig. 1c and Extended Data 
Fig. 1d-f). Almost identical distances were measured when rhodopsin 
was bound to G in the presence or absence of Fab_G50, suggesting that 
the Fab fragment does not affect the conformation of rhodopsin. In 
addition, the distance distributions of TM6 and TM7 relative to TM2 
are nearly identical between G;-bound and G,-bound rhodopsin?” 
(Extended Data Fig. le). 

The cryo-EM structure of the rhodopsin-G;-Fab complex was deter- 
mined at a nominal global resolution of 4.5 A (Fig. 1d and Extended 
Data Fig. 2a-f), which reveals a well-defined density map for the 
rhodopsin seven-transmembrane bundle, the Ga; Ras-like domain, 
and the G8 and Gy subunits (Fig. 2a, b and Extended Data Fig. 3a-g). 
The position of the a-helical domain (AHD) of Ga; is well defined 
owing to the direct stabilization of this domain by the Fab fragment, 
which simultaneously interacts with the Ga; AHD and the G8 subunit 
(Fig. 1d). The binding site of the Fab fragment is far away from the 
rhodopsin-G; interface, consistent with the fact that Fab binding did 
not affect the conformation of rhodopsin. 


The rhodopsin-G; interface 

The structure of the rhodopsin—G; complex (Fig. 2a, b and Extended 
Data Table 1) reveals that interactions between rhodopsin and G; are 
exclusively mediated through the Ga; subunit, and that there is no 
contact between rhodopsin and the GB subunits. The most important 
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Fig. 2 | The cryo-EM structure of the rhodopsin-G; complex. 

a, b, Orthogonal views of the cryo-EM density map of the rhodopsin-G; 
complex, coloured by subunits. c, d, Ribbon diagram representation of the 
structure of the rhodopsin-G; complex. 


interface between rhodopsin and Ga, is formed by the last 11 residues 
of the C-terminal 05-helix of Ga, (Fig. 2c, d), consistent with its known 
crucial role in receptor binding””~*>. In the complex, the C-terminal 11 
residues (residues 344-354) of Ga, adopt a straight amphipathic helix 
(a5-helix) to residue C351, followed by a three-residue loop (residue 
352-GLF-354) (Fig. 3a, b). 

The negatively charged dipole of the C terminus of the «5-helix 
forms an electrostatic interaction with the positively charged dipole 
of the N terminus of helix 8 of rhodopsin, with the carbonyl group 
of G352 forming the capping interaction with the N-terminal amine 
groups of helix 8 of rhodopsin (Fig. 3c). The cytoplasmic side of the 
rhodopsin transmembrane bundle is highly positively charged and 
this is known to be a common feature of GPCR structures”. Sequence 
alignment of all Ga; and Go, subtypes reveals a conserved sequence 
pattern (Fig. 3d), including D350 and G352, which is at the terminus 
of the a5-helix and allows the uncapped carbonyl group of a5-helix 
to form charge interactions with helix 8 of rhodopsin (Fig. 3b). The 
G352C-mutated Ga, did not crosslink with rhodopsin with cysteine 
mutations around the transmembrane bundle (Extended Data Fig. 4a, 
b), consistent with the finding that the G352C mutation in G; disrupts 
its binding capability to rhodopsin. 

Beside the charge interactions, the two large hydrophobic side chains 
L353 and L348 are directed towards the hydrophobic pocket of rhodop- 
sin formed by TM3, TM5, TM6 and TM7 (Fig. 3a, b). Replacement of 
these two residues by alanine, together with G352A, has marked effects 
on G; binding to rhodopsin*®. Residues 1344, K345 and C351 of Ga; 
are also within packing distance with TM3, TM5 and TM6 (Fig. 3a, b), 
and alanine mutations in these residues also show reduced G; binding 
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Fig. 3 | The rhodopsin-G; interface. a, b, Two views of the binding 
interface between the «5-helix of the Ga; Ras-like domain and the TMD 
cavity of rhodopsin. c, Rendering of electrostatic surfaces involved in 
interaction of rhodopsin and Goj, with blue for positively charged regions 
and red for negatively charged regions. d, Sequence alignment of the last 
11 residues of the a5-helix from different G proteins, with key residues in 
receptor binding highlighted by colour shading. 


to rhodopsin**. Thus, our structural studies provide a structural expla- 
nation to rationalize the extensive mutagenesis studies on the effect of 
the last 11 residues of the C terminus of Ga;*°. 

The second and less extensive interface between rhodopsin and G; 
is mediated through the N-terminal helix (aN-helix) of Gaj, where 
its residues (E28 and R32) are in proximity to interact with the intra- 
cellular loop (ICL2) between TM3 and TM4 (Extended Data Fig. 5c). 
In contrast to a short a-helix formed by the ICL2 loop in the arres- 
tin-bound rhodopsin structure”’, ICL2 in the G;-bound complex adopts 
an extended loop. The presence of the interface between ICL2 and 
aN-helix was confirmed by site-specific disulfide crosslinking from 
cysteine mutations in residue E28 in Ga; to residues N145!*? and 
F146!” in ICL2 of rhodopsin (Extended Data Fig. 4c, d), in agreement 
with previous crosslinking results between Go, and light-activated rho- 
dopsin by a chemically activated crosslinking reagent*”. Furthermore, 
R32A in Gq, is a rare mutation that increases the binding of Ga; to 
thodopsin**. On the basis of the structure, the large side chain of R32 
in Gq, is in the position that could interfere with the close interactions 


Cytoplasmic view 


Fig. 4 | Structural comparison of G;-bound rhodopsin with inactive 
rhodopsin, arrestin-bound rhodopsin, and GaCT-bound rhodopsin. 
a, Side and cytoplasmic views of the G;-bound transmembrane bundle 
(orange) in superposition to the inactive rhodopsin (PDB code 1U19, 
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between the aN-helix in Ga; and the ICL2 of rhodopsin and the 
removal of the large side chain in R32A mutations could thus enhance 
rhodopsin-Gi interactions. 

Crystal structures of rhodopsin have been determined in several 
different states, including the inactive/inverse agonist-bound state® 
and the active arrestin-bound or GaCT-bound state!®!? (GaCT is a 
high-affinity peptide from the C terminus of Ga;). Comparisons of 
these structures reveal intriguing conformational differences in rho- 
dopsin between each state (Fig. 4a). Compared to the inactive state, 
the G;-bound rhodopsin structure has an extended TM5 and outward 
movements in TM6, TM1 and TM4 at the cytoplasmic side (Fig. 4a, 
Supplementary Video 1), which results in an elastic pocket for bind- 
ing the G protein. The overall structure of rhodopsin bound to G; is 
similar to that bound to the GaCT, consistent with the equivalence 
of rhodopsin coupling with G; and G, (Fig. 4b). The conformation of 
the C-terminal 11 residues overlaps well with the structure of GaCT. 
However, the position of the GaCT is shifted a half-helical turn 
(approximately 2.2 A) deeper into the rhodopsin helical bundle owing 
to the lack of constrain on the GaCT peptide (Fig. 4b). 

Structural comparison of rhodopsin in the G;-bound and arres- 
tin-bound states also reveals notable differences in the receptor 
conformation, including TM1, TM4, TM6 and helix 8 (Fig. 4a, 
Supplementary Video 2), as well as ICL2, which adopts a short a-helix 
in arrestin-bound rhodopsin but exists as an extended loop in the G;- 
bound state (Fig. 4a). These conformational differences may represent 
the unique structural signatures of rhodopsin to distinguish between 
G protein and arrestin. In addition, other factors such as phosphoryla- 
tion at the rhodopsin C-terminal tail may further enhance the specific 
interactions of arrestin with the activated receptor™*. 


Structural basis of G; and G, selectivity 

Rhodopsin and BAR are the two best studied GPCRs that are coupled 
to G; and G,, respectively. Structural comparisons between these two 
receptors reveal the basis for selective coupling of G, and G, by a GPCR. 
In the inactive state, both receptors adopt a very similar closed confor- 
mation, in which the cytoplasmic ends of TM6, TM1, TM2, TM4 and 
the turn between TM7 and helix 8 are in nearly identical positions to 
each other (Fig. 5a, b). In the active state, the cytoplasmic ends of TM2 
and TM4 of both receptors remain in identical position, but the cyto- 
plasmic ends of TM6, TM1 and the turn between TM7 and helix 8 differ 
considerably between the two receptors (Fig. 5c, d, Supplementary 
Video 3). In particular, the cytoplasmic end of TM6 in rhodopsin is 
shifted inward by approximately 8 A relative to that of TM6 seen in 
the G,-bound 8,AR complex or in the G,-bound class B GPCR struc- 
tures (Extended Data Fig. 5a—-d). Correspondingly, the a5-helix of 
Gq, is rotated 20° away from TM6 towards helix 8 and forms a cap- 
ping interaction with the N-terminal dipole of helix 8 (Fig. 3a, b). 


Peptide complex 
G, complex 


cyan) and arrestin-bound rhodopsin (PDB code 4ZWJ, green). 

b, Superposition of G;-bound rhodopsin (orange) with GaCT-bound 
rhodopsin (yellow). Differences in transmembrane domains at the 
cytoplasmic faces are highlighted. 
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GB/Gy from G, complex 
GB/Gy from G, complex 


Fig. 5 | Structural comparison of G;-bound rhodopsin with G,-bound 
G2AR. a, b, Side view of the inactive rhodopsin structure (PDB code 1U19, 
cyan) superposed with inactive B2AR (PDB code 2RH1, pink). c, d, Side 
and cytoplasmic views of G;-bound rhodopsin compared to G,-bound 
BAR (PDB code 3SN6, blue). Notable structural changes are seen for the 
intracellular domains of TM6 with a difference of approximately 8 A at 

the cytoplasmic end of the helix. The a5 helix of Ga; (green) is rotated 20° 
away from TM6 compared to that of Ga; (purple). As indicated, there are 
differences in the locations of the a-helical domains (AHD) of these two G 
proteins. e, f, Illustration of the 20° rotation of Ga; (e) and the 16 A shift in 
Bry as compared to the structure of G, (f). 


Compared to the straight «5-helix in Gaj, the a5-helix in Ga, is slightly 
kinked (Fig. 5d), which allows the C terminus of this helix to orient 
away from helix 8 towards TM6 of rhodopsin. Accompanying the 20° 
rotation of the a5-helix between Go; and Ga, is a rigid body rotation 
of the G; heterotrimer, which displays rearrangements up to 16A in 
the areas of the Ga; N-terminal end and Gy subunits (Fig. 5e, f). The 
observed differences between binding of G; and G, to their receptors 
are in good agreement with a model of the rhodopsin-Gi complex 
that resulted from experimental DEER distance mapping”*®. On the 
basis of these structural observations, we reason that the conforma- 
tional difference in the TM6 between rhodopsin and B,AR is one of the 
major determinants for the coupling specificity of G; and G, in the two 
receptors. This movement of TM6 upon activation is a general theme 
in seven-helix transmembrane receptors, and was originally observed 
in cryo-EM studies of light-induced conformational changes in the 
proton pump bacteriorhodopsin*®®*”. 

An additional cause of the coupling specificity of G, and G, between 
rhodopsin and 8,AR may reside in the regions of the TM5-a4-helix 
interface (Extended Data Fig. 5e, f). Superposition of G, and G; in the 
two receptor complexes reveals a steric collision between the extended 
TM5 of BAR and a4-helix of Ga;, consistent with the involvement of 


this region in receptor coupling specificity#"". 
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Structural comparison of our nucleotide-free G, complex with the 
GDP-bound inactive G; complex also reveals an interesting mechanism 
of rhodopsin-mediated G; activation (Extended Data Fig. 6a). Upon 
binding to rhodopsin, the a5-helix was rotated by 90° and extended 
by two additional helical turns into the rhodopsin cytoplasmic pocket 
(Extended Data Fig. 6b), which resulted in conformational changes in 
the loop between (6 and a5-helix, a key structural element of the GDP- 
binding pocket. This conformational change leads to a movement of 
a1 -helix into the GDP-binding pocket, thus expelling the GDP from 
its pocket (Extended Data Fig. 6c, d). The release of GDP from Ga; 
induces separation of the Ga; Ras domain from its AHD, the con- 
formation of which was captured by the Fab fragment as observed in 
our structure. The mechanism of rhodopsin-mediated G; activation 
is similar to that of 8,AR-mediated G, activation'**’, suggesting a 
common mechanism of GPCR-mediated G protein activation. 

To gain further insight into differential conformational dynamics 
of TM6 between active rhodopsin and 3,AR, we performed all-atom 
mollified adaptive biasing potential (mABP) simulations of the recep- 
tors with the fABMACS software package to accelerate sampling“ 
(Extended Data Table 2). By calculating the free energy landscape 
of TM6 movement relative to the receptor transmembrane bundle 
(Extended Data Fig. 7a, b), these biased simulations revealed that the 
dynamic range of TM6 of rhodopsin is considerably less than that 
of B2AR (Fig. 6a—c). Furthermore, the TM6 of rhodopsin was una- 
ble to swing outwards to sample conformations comparable to that 
of BAR, resulting in steric clashes between rhodopsin TM6 and G,, 
consistent with the inability of rhodopsin to couple with G, (Fig. 6c 
and Extended Data Fig. 1b). To extend this mechanism to other class A 
GPCRs, we performed similar simulations for the ,t-opioid receptor 1 
(wOR1)*, a Gj-coupled receptor, and the adenosine Az, receptor 
(AzaR)”°, a G,-coupled receptor. These simulations also revealed that 
TM6 of ORI remained in a more closed conformation, whereas TM6 
of AzR favoured an outward conformation (Fig. 6a—c and Extended 
Data Fig. 7c, d). We note however that both G, coupler simulations 
were performed in the absence of a complete ICL3, whereas both G; 
couplers had their ICL3 intact. Additional simulations with ICL3- 
truncated G; couplers revealed no notable differences compared to the 
full-length receptor (Fig. 6b), indicating that their ICL3 was not suffi- 
cient to restrict the outward movement of TM6. Together, these results 
suggest that conformational differences in TM6 may represent the basis 
for stratification of GPCRs into distinct conformational groups with 
respect to the coupling specificity for G; or G; by GPCRs“ (Fig. 6d). 


TM6 sequence motif for G; selectivity 

To investigate the basis for the different TM6 dynamics between G; 
and G, coupling GPCRs, we performed sequence analysis on TM6, 
which reveals that G;- and G,-coupled receptors exhibit distinct, 
inversely related enrichment patterns for polar and hydrophobic res- 
idues at the membrane interface (Extended Data Fig. 8a—c). For Gi- 
coupled receptors, an enrichment of polar, often positively charged, 
residues at TM6.31/34/35 could act to stabilize the receptor within the 
charged lipid head groups thus preventing outward movement of TM6 
(Extended Data Fig. 8c). By contrast, an enrichment of hydrophobic 
residues found in G, coupling receptors would promote an outward 
swing of TM6 favouring interactions with the hydrophobic lipid tails. In 
addition, we found another potential selectivity filter at TM6.36 where 
G; and G, couplers exhibit differential enrichment of polar/hydropho- 
bic residues. Comparison of our Gj-bound rhodopsin to Gs-bound 
BAR and miniG,-bound AzaR shows that M253°”* of rhodopsin forms 
a hydrophobic interaction with L353 of Ga;, whereas T274°** of 8,AR 
and $234°*6 of AzaR do not appear to interact with the G, or miniG, 
(Extended Data Fig. 8d-f). Instead these polar residues in G,-coupled 
receptors are enriched near an extra kink specifically in the TM6 helix 
of G,-coupled receptors. We questioned whether they may have a role 
to destabilize TM6 and promote its outward kink. Indeed, throughout 
simulations, both $234 and T274 form extensive hydrogen bonding 
with the kinked TM6 backbone of 82AR and A»4R (Extended Data 
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Fig. 6 | TM6 dynamics of G;- and G,-coupled receptors. a, G;-coupled 
receptors exhibit a markedly constrained range of motion compared to 
G,-coupled receptors. Comparison of overlapped free energy landscapes 
truncated at 10kJ mol“! for G;- and G,-coupling GPCRs plotted with their 
collective variables used for mABP simulations. CV1 and CV2, collective 
variables 1 and 2, respectively. b, Weighted TM4.40 to TM6.27 C, distance 
distributions for full-length (simulations 1 and 2) and ICL3-truncated 
receptor simulations (simulations 3-5). An asterisk on full-length 
indicates that ICL3 was absent in both G, couplers. Reference distances for 
Gj- and G,-bound states (shown as black vertical dashed lines) are taken 


Fig. 8g). We speculate that the polar/non-polar residue distribution in 
TM6 may contribute to differential conformational dynamics of TM6 
between G;- and G,-coupled receptors. 

In summary, our results show that the conformational change, espe- 
cially the outward movement of TM6, is less pronounced for G;-bound 
rhodopsin than the corresponding movement in the G,-bound 82AR 
structure. This conformational difference seems to be the key deter- 
minant for the different docking of the C-terminal «5-helix between 
Go; and Ga, into the receptor transmembrane bundle. Energy land- 
scape analysis of a different set of Gi- and G,-coupled receptors also 
revealed that they have similar profiles to those of rhodopsin and 3,AR, 
respectively, with respect to the position of TM6. Sequence analysis 
reveals that the differential swing of TM6 between G;- and G,-coupled 
receptors could be attributed to different polar/nonpolar residue dis- 
tribution in the TM6. Together, these data suggest that the basis for 
G,- and G,-coupling selectivity observed in rhodopsin and 8,AR is a 
general theme for GPCRs to distinguish between G; and G, receptors. 
Gj-coupled receptors represent the largest subgroup of receptors in the 
GPCR superfamily. On the basis of the common features of the pos- 
itively charged transmembrane bundle of GPCRs and the conserved 
sequences of the a5-helix in Ga, we expect that the structure of the 
rhodopsin-G; complex will serve as a model for understanding signal- 
ling of other G;-coupled receptors. 
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from the structure reported here and the 8,AR-G, protein complex (PDB 
code 3SN6). For G;-coupled receptors with and without a complete ICL3, 
the outward movement of TM6 is approximately 6 A less than that of G, 
couplers. c, Representative snapshots of the most probable TM6 position 
taken from the first simulation replicate, overlapped with the 8,AR-G, 
crystal structure. Steric clashed can be seen between the TM6 of both G; 
couplers and G,. d, Schematic depicting alternative TM6 conformational 
states as the structural determinants for selective coupling of G, and 

G,. TM6 distance ranges were calculated using the structure of inactive 
rhodopsin (PDB code 1F88) as a fixed reference point. 


are available in the online version of the paper at https://doi.org/10.1038/s41586- 
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METHODS 


No statistical methods were used to predetermine sample size. The experiments 
were not randomized, and investigators were not blinded to allocation during 
experiments and outcome assessment. 

Constructs and expression of human rhodopsin. Human rhodopsin with four 
mutations, N2N'™C, E113378Q, M257°4°Y and N282®13¢, was cloned into 
pFastBac vector. To facilitate expression and purification, an N-terminal 8 x His- 
sfGFP-BRIL epitope and a TEV protease site were inserted after sfGFP. These 
constructs were expressed in Sf9 insect cells using the Bac-to-Bac Baculovirus 
system (Invitrogen). The cells were infected with baculovirus at 27°C for 48h 
before collection. 

Constructs, expression and purification of G; heterotrimer. The heterotrimeirc 
G; complex was expressed in Sf9 insect cells (Invitrogen). Human Go, was cloned 
in pFastbac vector with 6x His-MBP at the N terminus, and the virus was pre- 
pared using Bac-to-Bac system (Invitrogen). Rat G81 and N-terminal 6 x His-MBP 
tagged bovine G~2 were cloned into a pFastBac vector. The virus was prepared 
using the Bac-to-Bac system. The cells were infected with both Ga; and GBy virus 
at 27°C for 48h before collection, and the ratio was determined by small-scale 
titration experiment. The G; heterotrimer was purified as previously described”°. 
DEER spectroscopy. Generation of rhodopsin mutants, expression in HEK293S 
GnTI- cells, and spin-labelling with 1-oxyl-2,2,5,5-tetramethyl-A?-pyrroline- 
3-methyl) Methanethiosulfonate (MTSSL, Toronto Research Chemicals) was 
as previously described*”. For DEER measurements, spin-labelled rhodopsin 
mutants bound to a 1D4-antibody (MA1-722, ThermoFisher)-conjugated 
column were thoroughly washed and complexed with G; heterotrimer or with 
the G;-Fab_G50 complex in 20mM HEPES, pH 7.2, 100mM NaCl, 0.02% 
n-dodecyl-B-p-maltopyranoside (DDM), 0.004% cholesteryl hemisuccinate (CHS) 
and 541M ATR under yellow (>500 nm) light illumination on ice. The entire rho- 
dopsin-G; or rhodopsin—Gi-Fab_G50 complex was then further washed and 
eluted from the 1D4 column with 20 mM HEPES, pH 7.2, 100 mM NaCl, 0.1% 
digitonin, 150\1M 1D4 peptide and 541M ATR. The eluate was concentrated, and 
20% (v/v) deuterated glycerol was added. The complexes were added to quartz 
capillaries (1.5mm inner diameter and 1.8 mm outer diameter) and flash-frozen 
using a dry ice/ethanol bath. The capillaries were loaded into an EN 5107D2 reso- 
nator, and Q-band measurements were performed at 80 K on a Bruker Elexsys 580 
spectrometer with a Super Q-FTu Bridge. For the four-pulse DEER experiment 
used here, a 32-ns 1-pump pulse was applied to the low field peak of the nitrox- 
ide field-swept spectrum, and the observer 7/2 (16 ns) and x (32 ns) pulses were 
positioned 50 MHz (17.8G) upfield, which corresponds to the nitroxide centre 
line. Distance distributions were obtained from the raw dipolar evolution data 
using the LabVIEW (National Instruments) ‘LongDistances’ program, developed 
by C. Altenbach, which can be downloaded from http://www.biochemistry.ucla. 
edu/Faculty/Hubbell/. 

Phage display selections. Avi-tagged heterotrimeric G, was used for biopanning 
during selection. The G; construct has an Avitag on the N-terminal end of the 
gamma subunit (G+). The Avitag was biotinylated in vivo in Sf9 cells by co-expression 
of the biotin ligase BirA in presence of supplemented p-biotin. Pull-down exper- 
iments on Streptavidin magnetic beads showed quantitative biotinylation of 
the trimeric complex used in selection. Phage display selection was performed 
according to published protocols“. In brief, in the first round, 200nM of target 
was immobilized on 250 1] magnetic beads. Then, 100 11 of a phage library”? con- 
taining 10'*-10"? virions were added to the Streptavidin beads and incubated for 
30 min. The resuspended beads containing bound virions were washed exten- 
sively and then used to infect freshly grown log phase Escherichia coli XL1-Blue 
cells. Phages were amplified overnight in 2xYT media (Fisher Scientific) with 
50g ml! ampicillin and 10° p.f.u. ml~! of M13-KO7 helper phage. To increase 
the stringency of selection, three additional rounds of sorting were performed 
with decreasing the target concentration in each round (second round: 50nM, 
third round: 10nM and fourth round: 10 and 5nM) using the amplified pool of 
virions of the preceding round as the input. Sorting from the second to fourth 
rounds was done on a Kingfisher instrument. From the second to fourth rounds, 
the targets were premixed with the amplified phage pool and then Streptavidin 
beads were added to the mixture. From the second round onwards, the bound 
phages were eluted using 0.1 M glycine, pH 2.7. This technique often risks the 
elution of non-specific and Streptavidin binders, which tend to overpopulate the 
amplified phage pool thereby reducing the chance to obtain the desired specific 
clones. To eliminate them, the precipitated virions from the second round onwards 
were negatively selected against 100 11 of Streptavidin beads. The pre-cleared phage 
pool was then used as an input for the selection. 

Single-point phage ELISA. All ELISA experiments were performed in 96-well 
plates coated with 501 of 2jg ml~! neutravidin in NayCO; buffer, pH 9.6 and 
subsequently blocked by 0.5% BSA in PBS. A single-point phage ELISA was used to 
rapidly screen the binding of the obtained Fab fragments in phage format. Colonies 
of E. coli XL1-Blue harbouring phagemids were inoculated directly into 500 il 
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of 2xYT broth supplemented with 100,.g ml! ampicillin and M13-KO7 helper 
phage. The cultures were grown at 37°C for 16-20h at 280 r.p.m. in a 96-deep-well 
block plate. Culture supernatants containing Fab phage were diluted tenfold in 
PBST buffer. After 15 min of incubation, the mixtures were transferred to ELISA 
plates that were incubated with 50 nM biotinylated trimeric Gj; in experimental 
wells and with buffer in control wells for 15 min. The ELISA plates were incubated 
with the phage for another 15 min and then washed with PBST. The washed ELISA 
plates were incubated with horseradish perixoidase (HRP)-conjugated anti-M13 
mouse monoclonal antibody (ab50370, Abcam,1:5,000 dilution in PBST) for 
30 min. The plates were again washed, developed with TMB substrate and then 
quenched with 1.0 M HCl, and the absorbance at 450 nm was determined. The 
background binding of the phage was monitored by the absorbance from the con- 
trol wells. 

Sequencing, cloning, overexpression and purification of Fab fragments. From 
phage ELISA, clones (selected based on a high ratio of ELISA signal of target 
binding to background) were sequenced at the DNA Sequencing Facility at the 
University of Chicago. Unique clones were sub-cloned in pRH2.2, an IPTG inducible 
vector for expression of Fabs in E. coli. E. coli BL21 (Gold) cells were transformed 
with sequence-verified clones of Fab fragments in pRH2.2. Fab fragments were 
grown in 2xYT media with 100,1g ml! ampicillin at 37°C for 2-2.5h, during 
which A¢o0 nm reached 0.6-0.8, induced with 1 mM IPTG and grown for a further 
4.5h at 37°C. Harvested cells were kept frozen at —80°C until use. Frozen pellets 
were re-suspended in PBS supplemented with 1 mM PMSF, and 1g ml! DNase I. 
The suspension was lysed by ultrasonication. The cell lysate was incubated at 65°C 
for 30 min to eliminate of any undesired proteolysed fragments of the Fab produced 
during overexpression. Heat-treated lysate was then cleared by centrifugation, 
filtered through 0.22\.m filter and loaded onto a HiTrap MabSelect SuRe 5-mL 
column pre-equilibrated with lysis buffer (20 mM phosphate buffer, pH 7.5, 
500 mM NaC)). The column was washed with 10 column volumes of lysis buffer 
followed by elution of Fab fragments with elution buffer (0.1 M acetic acid). 
Fractions containing protein were directly loaded onto a Resource S 1-mL 
column pre-equilibrated with buffer A (50mM sodium acetate, pH 5.0) followed 
by washing with 10 column volumes wash with buffer A. Fab fragments were 
eluted with a linear gradient 0-50% of buffer B (50mM sodium acetate, pH 5.0, 
2.0M NaC]). Affinity and ion-exchange chromatography were performed using 
an automated program on AKTA explorer system. Purified Fab fragments were 
dialysed overnight against 20 mM HEPES, pH 7.4, 150mM NaCl. The quality of 
purified Fab fragments was analysed by SDS-PAGE. 

Rhodopsin-G;-Fab complex formation and purification. Sf9 cell pellets infected 
with virus containing rhodopsin were lysed in 20mM HEPES, pH 7.2, 10mM 
NaCl, and 10mM MgCl). The supernatant was centrifuged at 160,000g for 30 min 
to collect the membranes. The membranes were washed by homogenization in 
20mM HEPES, pH 7.2, 1 M NaCl and 10mM MgCh, and then was collected by 
centrifugation at 160,000g¢ for 30 min. 

The rhodopsin-G; complex was formed in membranes as described previ- 
ously!®. The washed membranes were re-suspended in 20mM HEPES, pH 7.2, 
100 mM NaCl, 10% glycerol, and 541M ATR. For 11 of rhodopsin cell pellets, 6mg 
of G; 10 mg Fab_G50 and 1 U of apyrase were added. The sample was incubated 
overnight at 4°C. The membranes were then solubilized in 20 mM HEPES, pH 
7.2, 100mM NaCl, 10% glycerol, 0.5% DDM (Anatrace), 0.1% cholesteryl hem- 
isuccinate (CHS), and 51M ATR for 2h at 4°C. The supernatant was isolated by 
centrifugation at 160,000g for 1h, and then was incubated with TALON IMAC 
resin (Clontech) for 3h at 4°C. After binding, the resin was washed with 10 col- 
umn volumes of 20mM HEPES, pH 7.2, 100 mM NaCl, 0.02% DDM, 0.004 CHS, 
50 mM imidazole and 5\1M ATR. The buffer was exchanged to 20 mM HEPES, pH 
7.2, 100mM NaCl, 0.1% digitonin and 541M ATR. The protein was then treated 
overnight with His-tagged TEV protease (made in-house) on column. The complex 
sample was eluted with 20mM HEPES, pH 7.2, 100mM NaCl, 0.1% digitonin, 
50mM imidazole and 51M ATR. The rhodopsin-G;-Fab complex sample was 
concentrated and loaded onto Superdex $200 10/300 GL column with running 
buffer 20mM HEPES, pH 7.2, 100mM NaCl, 0.1% digitonin and 511M ATR; the 
fractions for the monomeric complex were collected and concentrated individually 
for electron microscopy experiments. 

Negative-stain analysis of rhodopsin-G;-Fab complex. Protein samples were 
applied to a freshly glow-discharged carbon coated copper grid and allowed to 
adhere for 10s before being reduced to a thin film by blotting. Immediately after 
blotting, 3 jl of a 1% solution of uranyl formate was applied to the grid and blotted 
off directly. This was repeated three times. Data were acquired using a Tecnai 
Spirit transmission electron microscope operating at 120 kV. Images were pro- 
cessed using Relion 2.1°°. 

Cryo-EM data acquisition. A droplet (2.7511) of purified rhodopsin-G,-Fab com- 
plex at a concentration of about 9 mg ml”! was applied to a glow-discharged holey 
carbon grid (Quantifoil R1.2/1.3, Au 300 mesh), and subsequently vitrified using a 
Vitrobot Mark IV (FEI Company). Cryo-EM data were collected on a Titan Krios 
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microscope using a K2 camera positioned post a GIF quantum energy filter, with 
a slit width of 20 eV. Micrographs were recorded in super-resolution mode at a 
magnified physical pixel size of 1.074 A, with defocus values ranging from — 1.3 to 
—3.0,m. The total exposure time was set to 6s with intermediate frames recorded 
every 0.25, resulting in an accumulated dose of about 60 electrons per A? and a 
total of 30 frames per movie stack. 

Image processing and structural refinement. In the initial phase of processing, 
unbinned image stacks from 19,368 K2 movies were corrected for drift and for 
beam-induced motion by alignment using cross-correlation as implemented in 
Unblur®!. Dose-corrected integrated frames were used for subsequent image pro- 
cessing. Particles were automatically picked using RELION 2.1° using 6 projection 
image references, generated by applying 2D classification on 1,000 manually picked 
particles. CTFFIND4 was used for CTF estimation®. Next, a set of 1.65 million 
particles were extracted from the 14,464 integrated frames displaying Thon rings 
extending beyond 4.5 A. The extracted particle images were normalized, and sub- 
jected to 50 rounds of both iterative 2D classification (regularization parameter 
T=2) and 3D classification (T= 4). Uninterpretable, sparsely populated, or poorly 
defined classes were discarded at both stages leaving behind approximately 227k 
particles for further 3D processing (Extended Data Fig. 3). A coarse initial model 
was generated using the 3D initial model generation module in RELION and the 
model was refined until convergence was achieved. Density maps were corrected 
with a B-factor of —217 and ‘gold-standard’ Fourier shell correlation (FSC) resolu- 
tion plots were calculated with a soft shape mask applied to independent, unfiltered 
half-maps resulting from the processing. 

The crystal structure of human rhodopsin (PDB code 4ZW)J) and G protein 
complex (PDB code 1GG2) were used as initial models for model rebuilding and 
refinement against the electron microscopy map. All models were docked into the 
electron microscopy density map using Chimera’®, followed by iterative manual 
adjustment in COOT™, fragment-based refinement with Rosetta°’, and real space 
refinement using Phenix programs”*. The model statistics was validated using 
MolProbity*”. Structural figures were prepared in Chimera and PyMOL (https:// 
pymol.org/2/). The final refinement statistics are provided in Supplementary 
Table 3. The extent of any model overfitting during refinement was measured 
by refining the final model against one of the half-maps and by comparing the 
resulting map versus model FSC curves with the two half-maps and the full model. 
In-cell disulfide bond cross-linking. The open-reading frames of mini-Gi with 
an N-terminal Flag tag and full-length rhodopsin with a C-terminal haemagglu- 
tinin (HA) tag were cloned into pcDNA6. Cysteine mutations (E28C for miniG; 
and N145C or F146C for rhodopsin) were systematically introduced in these two 
DNA vectors. AD293 cells were split 1 day before transfection at 50,000 cells per 
well in a 24-well plate. Cells were grown for 1 day, then transfected with 100 ng 
rhodopsin constructs (pcDNA6-rho-3HA) plus 100 ng G; plasmid (pcDNA6- 
3xFlag-miniGi), 100 ng pcDNA6-G8, and 100 ng pcDNA6-G7 by Lipofectamine 
2000 (DNA:Lipofectamine 2000 ratio of 1:2) in each well. Cells were grown for 
2 days after transfection, and were then treated at room temperature with H2O>, 
which was freshly diluted in the cell culture medium to a final concentration of 
1mM. After 5 min of treatment with H2O2, the medium was aspirated and 100, 
of CelLytic M (Sigma C2978) were added to each well and the plate was shaken for 
10 min at room temperature. Cell lysates were transferred to a 1.5 ml tubes, spun 
at 18,000g at 4°C for 5 min. The supernatants (10 11) were mixed with an equal 
volume of 2x SDS loading buffer (without reducing agents) for 5 min at room tem- 
perature, and loaded onto a protein gel for western blot analysis. HRP-conjugated 
anti-Flag (A8592, Sigma) and anti-HA (H3663, Sigma) antibodies were used to 
probe for free and cross-linked miniGi and rhodopsin proteins. 

Ga;-mediated ERK activation assay and cAMP accumulation assay. AD293 
cells were plated at a density of 5 x 10* per well in 24-well plates 1 day before 
transfection. Cells were then transiently transfected using Lipofectamine 2000 
reagent (Life Technologies) with 50ng cDNA encoding GPCR, 200 ng luciferase 
reporter construct containing an ERK response element (SRE) or cAMP response 
element (CRE), and 10ng TK-Renilla at a Lipofectamine 2000 reagent:DNA ratio of 
2:1. ERK activation or cAMP accumulation was detected using the Dual-luciferase 
reporter assay system from Promega according to the manufacturer's instructions 
using an EnVision plate reader (Perkin Elmer). Renilla luciferase was used for 
normalization. All experiments were performed in triplicate, with each well trans- 
fected independently. 

Molecular dynamics simulation setup and equilibration. Molecular dynamics 
simulations of two G; couplers, rhodopsin (PDB code 3PQR) and mOR1 (PDB 
code 5C1M), and two G, couplers, 3.AR (PDB code 4LDE) and A24R (PDB code 
5G53), were initiated from structures in the active (S4) conformation as described 
previously>®. All receptors selected met the following inclusion criteria: a fully 
active, agonist-bound receptor in complex with a peptide, nanobody, or engineered 
G protein, and free of any ICL3 fusion that may influence the conformational state 
of the receptor. 


Each receptor was prepared for simulations as follows. Crystallization part- 
ners and heteroatoms, with exception of agonists bound within the orthosteric 
binding site and palmitoylated cysteine residues on helix 8, were removed. 
Thermostabilizing and non-native mutations were reverted back to their wild-type 
residue and missing residues not within ICL3 were modelled and subjected to 5,000 
rounds of ‘very_slow loop refinement assed by DOPE scoring using Modeller 
9.17°°. Palmitoyl groups on helix 8 were then added and protonation states for 
active receptors were assigned based on previous publications**"!. The resulting 
complexes were capped with neutral acetyl and methylamine groups and embed- 
ded into a pre-equilbirated palmitoyl-oleoyl-phosphatidylcholine (POPC) lipid 
bilayer, solvated in a box of TIP3P waters allowing for 14 A of padding on all sides 
with 150mM NaCl, and neutrailized by removing appropriate ions or counter ions 
using the Desmond system builder within Maestro (Schrédinger Release 2018-1: 
Maestro, Schrédinger, LLC, New York, 2018). Full details for each system can be 
found in Supplementary Table 1. 

All-atom atmospheric simulations were performed using GROMACS5.0.6 
with the CHARMM36m force field and periodic boundary conditions®. Ligand 
parameters were generated by SwissParam®. Before production simulations, 50,000 
steps of energy minimization were performed, followed by equilibration in the 
canonical (NVT) and isothermal-isobaric (NPT) ensembles for 10 and 50 ns, 
respectively, with positional restraints (1,000 kJ mol~! nm~) placed on backbone 
atoms. Temperature was maintained at 310 K using the v-rescale method with a 
coupling time of 0.1 ps and pressure was maintained at 1 bar using the Berendsen 
barostat with a coupling time (t)) of 1.0 ps and compressibility of 4.5 x 107° bar! 
with a 2 fs timestep®. 

Mollified adaptive biasing potential simulations. To calculate the free energy 
landscape of TM6 movement relative to the transmembrane bundle of the receptor, 
we used mABP with overfill protection using fABMACS®*, A centre-of-geom- 
etry (COG) distance collective variable was introduced into fABMACS and two 
collective variables were defined: COG distance between select TM6 (6.27-6.42) 
and TM1/2 (1.50-1.59 and 2.39-2.48) C, atoms, and COG distance between select 
TM6 and TM3/4 (3.45-3.55 and 4.39-4.47) C, atoms (Extended Data Fig. 7). 
The collective variable space was discretized into a 480 x 480 grid with ranges 
of 0 to 50A for a bin width of 0.10 A. Biasing parameters were b=0.8, c=0.01/ 
&t, a= 10, and a maximum fill level of 30 kJ mol~!. Simulation parameters for 
mABP simulations were the same as those used during NPT equilibration except 
the Parrinello-Rahman barostat with a 5.0 ps coupling time was used. To pre- 
vent unwanted transitions away from the active conformation of the receptors, 
the NPXXY tyrosine was held in an active conformation and TM7 was restrained 
using Urey-Bradley harmonic potentials between residues 7.42-7.53, in which 
angles and distances were derived from the equilibrated structure and residues i, j, 
k of the potential were i, i+ 2, i+4. Simulations without TM7 restraints saw quick 
transitions towards receptor inactivation, consistent with prior long time scale 
simulations® (data not shown). For simulations in which ICL3 was truncated or 
not resolved, backbone distance restraints were placed on the last four residues to 
prevent helical unwinding. Independent mABP production simulations for each 
receptor were run in duplicate for approximately 121s in total. Throughout the 
simulations, the instantaneous boost was recorded for each recorded frame allow- 
ing for the generation of weighted histograms. Simulation analysis was performed 
using MDTraj 1.7.2 and VMD 1.9.21, Plots were generated using the R statistical 
package. 

Reporting summary. Further information on experimental design is available in 
the Nature Research Reporting Summary linked to this paper. 

Data availability. All data and source code are available upon request. 
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Extended Data Fig. 1 | Purification, characterization and cryo-EM 
images of the Rho-G;-Fab complex. a, Representative elution profile 

of the purified Rho—G,-Fab_G50 complex on Superdex 200 10/300 

gel filtration. b, SDS-PAGE analysis of the complex after gel filtration. 

c, The inability of rhodopsin to stimulate the G,-mediated signalling as 
assayed by the cAMP-driven luciferase reporter assays. The glucagon-like 
peptide 1 receptor (GLP-1R) shows stronger G,-meditated signalling with 
the agonist GLP-1 (n =3 independent experiments). Data are mean + s.d. 
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d, An overall view of rhodopsin showing the three intramolecular 
distances between two nitroxide N-O bonds based on the models of the 
R1 nitroxide pairs Y74R1-Q225R1, Y74R1-R252R1 and Y74R1-M308R1, 
respectively (Y74*41, Q225°-, R252°°, M3087; superscripts denote 
Ballesteros— Weinstein numbering). R1 side-chain modelling details have 
been described previously’. e, Similar DEER distance distributions of 
TM6 and TM7 to TM2 of rhodopsin bound to G; and G;. f, Time domain 
data of DEER measurements. 
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Extended Data Fig. 2 | Cryo-EM images and single-particle analysis refined against a half-map and compared to the two half-maps as well as 
of the Rho-G;,-Fab complex. a, Representative cryo-EM micrograph the full model. The r.m.s.d. between the model refined against half-map 
of Rho-G;-Fab complex. Examples of particle projections are circled. and compared to the full map, and the one refined against the full map 

b, Reference-free two-dimensional class averages of the complex in is 0.984 A, and their corresponding FSCs against the final map show a 
digitonin micelles. c, Half-map Fourier shell correlation (FSC) plots as resolution difference at the 0.5-cutoff of approximately 0.1 A. e, Particle 
produced by RELION with the mask used shown as an inset. d, FSC curve classification and refinement. f, Local resolution map of the rhodopsin-G, 
of model versus the full map, as well as FSC curves obtained for a model complex. 
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Extended Data Fig. 3 | Electron microscopy density map of rhodopsin- _ rhodopsin transmembrane helices and helix 8. e-g, An overall view of the 
G; complex. a—c, Three views of the electron microscopy density map of rhodoposin—Gq, interface (e), and electron microscopy density map for 
the rhodopsin-Gq, interface. d, Electron microscopy density map of all the TM6 of rhodopsin (f) and the a5-helix of Ga; (g). 
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Extended Data Fig. 4 | The rhodopsin-G; interface and disulfide with d; n= 3 independent experiments). c, Interactions at the interface 
crosslinking of rhodopsin with Ga;. a, The rhodopsin-G; interface between ICL2 of rhodopsin and aN helix of Gaj. The side chains are 
surrounding the G352 residue of Ga; «5-helix. Not all side chains shown not visible in the map but shown here for illustrating their C, positions. 
are visible in the map but shown here for illustrating their C, positions to d, Demonstration that E28C of Ga; can be disulfide cross-linked to 
facilitate understanding of data in panel b. b, Lack of disulfide crosslinking _ rhodopsin residues N145C'“ and F146C '“"? (n= 3 independent 

of G352C of G; with surrounding residues from rhodopsin (compare experiments). 
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Extended Data Fig. 5 | Structural comparison of G;-bound rhodopsin, and cytoplasmic views of G;-bound rhodopsin (orange) overlaid with 

G,-bound GLP-1R, and G,-bound CTR, and the role of «4-helix of G,-bound CTR (PDB code 5UZ7, grey). e, f, Side-by-side comparison 

Ga in receptor selectivity. a, b, Side and cytoplasmic views of G;-bound of the rhodopsin-G; complex (e) with the 8, AR-G, complex (f). 

rhodopsin (orange) overlaid with G,-bound GLP-1R (PDB code 5VAI, g. An overlay of the rhodopsin—G; complex with the B,AR-G, complex 

light blue, black arrows indicate differences in helix positions). c, d, Side reveals possible collision of TM5 of 82AR with a4-helix of Goi. 
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Extended Data Fig. 6 | The mechanism of rhodopsin-mediated G, a5-helix (b). c, d, Side-by-side comparison of the GDP-binding site of the 
activation. a, b, Superposition of the rhodopsin—G; complex with the Ga, Ras domain in the inactive GDP-bound Gq, (c) and nucleotide-free 
inactive GDP-bound G; (PDB code 1GG2) reveals separation of the AHD state Ga; with GDP added for comparison (d). 

from the Ras domain of Ga; (a) and conformational changes in the 
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atoms of TM6 to TM1/2 and TM6 to TM3/4 respectively. Collective 
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Extended Data Table 1 | Cryo-EM data collection, refinement and validation statistics 


Data collection and processing 
Magnification 
Voltage (kV) 
Electron exposure (e—/A’) 
Defocus range (uum) 
Pixel size (A) 
Symmetry imposed 
Initial particle images (no.) 
Final particle images (no.) 
Map resolution (A) 

FSC threshold 
Map resolution range (A) 


Refinement 
Initial model used (PDB code) 
Model resolution (A) 
FSC threshold 
Model resolution range (A) 
Map sharpening B factor (A?) 
Model composition 
Non-hydrogen atoms 
Protein residues 
Ligands 
B factors (A?) 
Protein 
Ligand 
R.m.s. deviations 
Bond lengths (A) 
Bond angles (°) 
Validation 
MolProbity score 
Clashscore 
Poor rotamers (%) 
Ramachandran plot 
Favored (%) 
Allowed (%) 
Disallowed (%) 


#1 Rho-Gi 
(EMDB-7517) 
(PDB 6CMO) 


45956 
300 

60 

-1.5 to -2.5 
1.074 

C1 
1656874 
227386 
4.5 

0.5 

3.6-5 
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Extended Data Table 2 | GPCR simulation systems used in the current study 
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Receptor : : y-opioid receptor 62 adrenergic adenosine A2A 
(Coupling) rhodopsin (Gut) 1 (Gi) receptor (Gs) receptor (Gs) 
PDB 3PQR (2.85 A) 5C1M (2.10 A) 4LDE (2.79 A) 5G53 (3.40 A) 
(resolution) " ; ° 7 
Crystallization : et 
Barnet Gt peptide nanobody nanobody mini Gs 
Ligand ATR (agonist) BU72 (agonist) Bl167107 (Agonist) NEC (agonist) 
77.2Ax77.2Ax 79.1Ax79.1Ax 81.1Ax81.1Ax 90.2Ax90.2Ax 
System Size 97.0 A; 10,610 93.3 A; 10,347 89.2 A; 10,198 89.7 A; 12,720 
y waters; 128 POPC; waters;135 POPC; waters; 143 POPC; waters; 191 POPC; 
54,353 atoms total 54,171 atoms total 54,557 atoms total 68,568 atoms total 
Protonated D83?°°, E113%28, aA Be a =i eA meh 26 
Residues — £122337, 134349 Pe D1303-49 pecee mid 
Palmitoylation C322, C323 C341 
Sim 1: 0.98 us 
Sim 1: 0.89 us Sim 2: 0.98 us eer pee oee 
Simulations Sim 2: 0.90 us Sim 3: 1.00 us Sate SS ee 
Sim 3: 1.00 us Sim 4: 1.00 us et aay 
Sim 5t 1.00 us 


Simulations marked with a superscript ‘t’ indicate truncation of a crystallographically resolved ICL3. 
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The class A adenosine A, receptor (A,R) is a G-protein-coupled receptor that preferentially couples to inhibitory G;/, 
heterotrimeric G proteins, has been implicated in numerous diseases, yet remains poorly targeted. Here we report 
the 3.6 A structure of the human A,R in complex with adenosine and heterotrimeric Giz protein determined by Volta 
phase plate cryo-electron microscopy. Compared to inactive A;R, there is contraction at the extracellular surface in the 
orthosteric binding site mediated via movement of transmembrane domains 1 and 2. At the intracellular surface, the 
G protein engages the A,R primarily via amino acids in the C terminus of the Ga; «5-helix, concomitant with a10.5 A 
outward movement of the A;R transmembrane domain 6. Comparison with the agonist-bound (32 adrenergic receptor- 
G,-protein complex reveals distinct orientations for each G-protein subtype upon engagement with its receptor. This 
active A;R structure provides molecular insights into receptor and G-protein selectivity. 


Adenosine (ADO) receptors comprise four subtypes within the class 
A G-protein-coupled receptor (GPCR) superfamily that mediate the 
actions of the purine nucleoside, ADO}. Activation of the AjR is thera- 
peutically desirable for ischaemia-reperfusion injury, atrial fibrillation, 
neuropathic pain and others!. Although ADO is used clinically to treat 
supraventricular tachycardia, the development of A;R-selective agonists 
for a broader range of disorders has thus far failed, primarily owing to 
dose-limiting on-target adverse effects*. Alternative approaches are 
thus necessary for improved A,R drug action, with studies focusing 
on the potential for greater A;R selectivity through targeting allosteric 
sites, or via development of A,R conformational state-selective biased 
agonists that can promote beneficial signalling while sparing pathways 
mediating on-target adverse effects*“. 

One area for the development of selective A; R-targeting drugs is 
the use of structure-based approaches that leverage advances in GPCR 
structural biology. Indeed, inactive-state, antagonist-bound, struc- 
tures of the A,R were solved using X-ray crystallography>*®. However, 
these studies required modification of the A;R via thermostabilizing 
mutations and/or fusion proteins, and cannot inform on mechanisms 
underlying agonist binding, A,R activation and G-protein interac- 
tion. These features are necessary for the rational design of selective 
AR activators, biased agonists or positive allosteric modulators. An 
alternative approach to overcoming the current dearth of active-state, 
G-protein-bound, GPCRs is the use of single-particle cryo-electron 
microscopy (cryo-EM)’°. The promise of cryo-EM in yielding active- 
state GPCR complexes has recently been realized through the solution 
of several receptor structures bound to agonists and the heterotrimeric 
G, protein, complementing the only crystal structure so far, to our 
knowledge, of an agonist-bound GPCR-G-protein complex, that of 
the 824R complexed to G, protein’~!°. However, the A,R preferentially 
couples to the Gi, family of G-proteins’. Indeed, of the more than 800 
human GPCRs, most preferentially couple to Gi/, proteins. The Gio 
family has four members, which are the most abundantly expressed 
G proteins throughout the body'!. Gi. protein activation is typically 


associated with inhibition of adenylate cyclase, resulting in reduced 
cAMP accumulation, but they also regulate numerous effectors includ- 
ing enzymes, ion channels and small GTPases. Based predominantly on 
the high expression of both Gjz proteins and A,Rs in brain and, albeit to 
a lesser degree, in cardiac tissues (both major organs for A;R therapies), 
we chose to focus on Gj as a transducer for the A;R. Here we report the 
first, to our knowledge, structure of a GPCR coupled to a heterotrimeric 
Gj protein, specifically the A;R-Gj2 complex bound to its endogenous 
agonist, ADO, solved using Volta phase-plate (VPP) cryo-EM. 


Solving the AyR-Gj. complex 

To facilitate complex formation, A;R and Giz were expressed sepa- 
rately in HighFive insect cells and combined after solubilizing in lauryl 
maltose-neopentyl glycol (LMNG) and cholesteryl hemisuccinate 
(CHS) with addition of apyrase and ADO (Extended Data Fig. 1). 
Stabilization of the A;R-Gj2 complex was achieved by introducing four 
Gai, subunit mutations that alter nucleotide binding and affinity for 
G6’. This dominant-negative Gi. (DNGjz) was sufficient to enable 
formation of a stable interaction with the receptor while insensitive to 
GTP (Extended Data Figs. 1 and 2c). The antagonist dipropylcyclopen- 
tylxanthine (DPCPX) displayed similar affinities for the AR whether 
alone or in the presence of wild-type Gajz or DNGiz (Extended Data 
Fig. 2a), whereas ADO displayed biphasic binding curves with a sim- 
ilar dispersion of high and low affinity states in the presence, but not 
absence, of either wild-type Gay or DNGiz (Extended Data Fig. 2b); 
a characteristic feature of agonist binding to many GPCRs". By con- 
trast, agonist-mediated [*°S]GTPS binding to activated Ga subunits 
was only observed upon combination of the AiR with wild-type Gaz 
(Extended Data Fig. 2c). 

The A;R-Gi complex in LMNG detergent micelles was visualized 
using a Titan Krios microscope equipped with a VPP. After imaging 
and initial 2D classification (Extended Data Fig. 3a, b), 3D classification 
yielded a final map at a nominal resolution of 3.6 A reconstructed from 
263,321 particle projections (Fig. 1, Extended Data Fig. 3c, Extended 
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Fig. 1 | The ADO-A,R-Gj. cryo-EM structure. a, Structure determined 
after refinement in the cryo-EM map (AjR, blue; ADO, purple; 
heterotrimeric Gy, pink, cyan and dark purple for a, 8 and 4, respectively). 


Data Table 1). The cryo-EM density map exhibited well-resolved side 
chains, allowing confident rotamer placements for most amino acids 
(Extended Data Figs. 3d and 4). The A,R transmembrane domain 
regions and extracellular and intracellular loops (ECL and ICL, respec- 
tively) are well defined, with the exception of 7 residues in ICL3 and the 
last 25 residues of helix 8. An ADO molecule is observed occupying 
the orthosteric site, and extra density at N159 indicates the presence 
of a glycosylation site that has a role in agonist binding; alanine sub- 
stitution of this residue caused a significant reduction in the affinity of 
the ADO derivative NECA (1-(6-amino-9H-purin-9-yl)-1-deoxy-N- 
ethyl-6-p-ribofuranuronamide) (Extended Data Fig. 2d-g). The G8 
and G7 subunits are also well resolved, except for their flexible N and 
C termini. The a-helical domain of Ga was averaged out in the 2D 
class averages owing to high flexibility and thus was masked out during 
map reconstruction, but the Ras-like domain is well ordered except for 
the flexible switch III region. 


Structure of the active-state AjR 

Notably, despite the presence of a different class of G protein relative to 
all other active GPCR structures solved so far, there seems to be a general 
conservation in activation mechanisms. Comparison of the inactive 
A,R bound to the covalent antagonist, DU172 (4-[3-(8-cyclohexyl- 
2,6-dioxo- 1-propyl-7H-purin-3-yl)propylcarbamoyl]benzenesulfonyl 
fluoride; Protein Data Bank (PDB) accession 5UEN)’, and the active 


Active A,R-ADO 
Inactive A,R-DU172 


Fig. 2 | Comparison of active and inactive A;R (PDB code 5UEN) 
structures. a—c, Side (a), extracellular (b) and cytoplasmic (c) views of 
the receptor (ADO-A,R-Gi2 complex in blue; inactive DU172-bound 
AR in grey). d, e, Active ADO-AjR (d) and inactive DU172-A\R (e) 
surfaces sliced to show binding site cavity. f, Orthosteric binding site of 
the active A;R-Gj. complex with ADO (purple ball and sticks). “Toggle 
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b, 3D cryo-EM map, coloured according to protein chains. c, 3D cryo-EM 
map coloured according to local resolution (A), defined from half-maps in 
Relion (v.2.1b1). 


A,R-Gy bound to ADO, reveals a global allosteric transition similar 
to that observed upon activation of the 8,AR bound to G,, or multiple 
class A active state GPCRs stabilized with nanobodies or a ‘mini-Ga, 
(mG,) protein!°13-!6 (Fig. 2a—c). A key characteristic of the A;R activat- 
ing transition is a large outward movement of the intracellular side of 
transmembrane helix 6 (TM6) by 10.5 A to accommodate the «5-helix 
of the Ga; protein (Fig. 2c). However, this is not as pronounced as in 
the G,-bound B,AR” (nor compared to the three class B G,-bound 
structures’~°), but similar to the change observed in the nanobody- 
stabilized 8,AR'®, My muscarinic acetylcholine receptor (M2R)!°, 
\-opioid receptor (,OR)" and «-opioid receptor (KOR)!” structures 
or the mG,-stabilized A,,R!°. Moreover, additional fundamental 
features of receptor activation previously observed in other (non-Gi/, 
preferring) class A GPCRs are also preserved, including rearrange- 
ments in conserved class A activation ‘microswitches; such as the PIF, 
NPXXY and DRY motifs’® (Fig. 2g), are very similar to the changes 
observed when comparing the active, agonist-bound, 8.AR-G, com- 
plex and inactive carazolol-bound 8,AR!’. Small rearrangements at 
the conserved PIF motif (P192>*°, 19539 and F243°* in the A,R; 
superscripts denote Ballesteros—-Weinstein numbering) are associated 
with the movement of TM6. A 4 A inward displacement of TM7 at the 
NPXXY motif (N2847?, P2857°° and Y2877) propagates to the base 
of helix 8, and is linked to an outward movement of ICL1. The partially 
formed ‘ionic lock; important in maintaining the ground state of the 


switch’ W247°* and residues within 4 A of ADO are labelled and shown 
as sticks. Red rectangles highlight rotamer changes upon activation. N, O 
and S atoms are coloured in blue, red and yellow, respectively. Dashed lines 
represent hydrogen bonds. g, Conserved class A GPCR motifs important 
for receptor activation (DRY motif, purple; NPXXY motif, blue; PIF motif, 
green). H8, helix 8. 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


ARTICLE 


. @ 883k 85 Ese 
Ballesteros & wo bw Ww oO NO OO 
Weinstein 4 oO orn uno rad 
numbering = aaa aa 8&8 
AR 
Go, ; 
numbering 1 
c 88 
aah aia 
QO 
NO 
NN 
B,AR 


Ga, 


G.hns1.3 39 


Fig. 3 | Comparison of Ga; and Ga, interactions. a, Diagram of 

the contacts between Gajz and A;R. Receptor residues are numbered 
according to Ballesteros and Weinstein numbering*; G-protein residues 
are numbered according to the CGN scheme”’. Blue receptor residue 
numbers indicate conservation between A;R and B2AR. Length of orange 


receptor’’ and observed in the inactive A;R°, is broken, and R105* 
of the DRY motif extends towards TM7 to form a lid over the Gaz 
a5-helix. 

Another key feature of the inactive A;R is the presence of a large 
binding cavity that accommodates the orthosteric site®. Upon activa- 
tion, this wide cavity collapses owing to an approximately 4 A inward 
movement of the extracellular ends of TM1 and TM2 (Fig. 2b, d, e). 
Of note, ECL2, which adopts a distinct orientation almost perpendic- 
ular to the plane of the membrane, and contributes to the binding of 
allosteric modulators’, remains essentially unaltered (Fig. 2a). The col- 
lapse of the extracellular cavity is less pronounced when compared to 
the inactive A,R bound to the Aj-selective xanthine-based antagonist 
PSB36 (1-butyl-8-(hexahydro-2,5-methanopentalen-3a(1H)-yl)-3,7- 
dihydro-3-(3-hydroxypropyl)-1H-purine-2,6-dione)° (Extended Data 
Fig. 5), probably reflecting that the latter, reversibly binding, antago- 
nist is smaller than the bulky DU172. Nonetheless, a shrinkage of the 
orthosteric site upon activation is also observed in A7,R'3!, B,AR!0, 
Mb>R!>3 and pOR'*4 Interestingly, despite notable differences in the 
ligand-binding sites between the inactive A;R and A24R, a compari- 
son of the active states of the two subtypes reveals an almost perfect 
superimposition of their transmembrane domain regions (root mean 
squared deviation (r.m.s.d.) value of 1.03) (Extended Data Fig. 6a-c). 

The side-chain conformations in the orthosteric site are very similar 
between the active and inactive A;Rs. Subtle upward movements of the 
TM3 backbone (approximately 1.5 A), and TM7 towards TM6 (1 A), 
side-chain movements of W247°“S (‘toggle switch’) and H278”, and 
a change in rotamer conformations of V87>, T9136 and T2777” 
collectively serve to accommodate the ADO ribose moiety (Fig. 2f). A 
role for the two threonine residues is consistent with previous studies, 
whereas mutation of V87A was suggested to have no effect””°. We 
thus re-investigated the role of V87A and found that, in our hands, the 
mutation significantly reduced both the affinity and efficacy of NECA 
(Extended Data Fig. 2d-g). Interestingly, the related (but G,-preferring) 
A2zaK has previously been solved not only in an active state, bound to 
NECA and a mG, protein, but also an ‘intermediate’ state, bound to 
either NECA or ADO but in the absence of transducer!3!”, Further 
comparison of our active A,R structure with either of the A24R active 
or intermediate agonist-bound states revealed similar shifts in TM3 and 
TM7, and similar changes in W246°48, H278743, T8836 and V843:32 


Salt bridge 

—— Hydrogen bond 

soees Hydrogen bond through backbone 
Van der Waals bond 


@ Hydrophobic 

@ Negatively charged 
©. Polar 

@ Positively charged 


G.H5.22 354 degree of conservation within G,,, family 
G.H5.25 393 conserved between G, and G,, families 
275 6.37 conserved between A,R and B,AR 


shading of Ga, numbers illustrates degree of conservation within the 
Gayo family. b, View of key interactions between A,R (blue) and DNGaiz 
(pink). Hydrogen bonds and salt bridges are depicted as dashed lines 
coloured as in panel a. c, Diagram of the contacts between Ga, and B,AR 
(based on PDB code 3SN6), coloured as in panel a. 


after agonist binding’*?"’ (Extended Data Fig. 6). These complemen- 


tary rearrangements result in nearly identical orthosteric sites between 
agonist-bound A;R and AjaR, suggesting that the propagation and 
selection of coupling partner must involve conformational changes 
downstream of this binding site. Other notable interactions between 
orthosteric site residues and ADO in the AjR include 1-stacking with 
F171 C2, hydrogen bonding with N254555, H2787-3 and E1722!2, 
and Van der Waals interactions with M180°38 and L250°>! (Fig. 2f, 
Extended Data Fig. 5f), which are all consistent with previous muta- 
tional and computational studies of agonist interactions with the 
A, R2528, 


Structure, coupling and selectivity of the Gj heterotrimer 
Despite the recent solution of a number of class A GPCR structures in 
‘active-like’ states stabilized by nanobodies or mG,, there still remains 
only a single class A GPCR structure bound to a heterotrimeric G pro- 
tein—the 8,AR-G, complex”. As also observed with that complex, 
interactions between G; and the A,R are extensive, with a total buried 
surface area of 1,964 A? (958 A? at the A,R and 1,006 A? at the Gay). 
Another similarity between the two GPCR-G-protein complexes is 
that the G protein interacts primarily via the Ga Ras-like domain. An 
alignment of both receptors (Extended Data Fig. 7a) and G proteins 
(Extended Data Fig. 7b), followed by a comparison of key interaction 
points between the a5-helix of each Ga subunit with its respective 
receptor, highlighted two distinct clusters (Fig. 3; Extended Data 
Fig. 4b, c). The first region is common to both Ga, and Gaj, encom- 
passing approximately residues 12-20 of each a5-helix (H5.8-H5.20, 
common Ga numbering (CGN) system’), possibly highlighting the 
broad importance of this region in providing strong contacts for recep- 
tor binding. However, for the 8,AR-G, complex, there are more inter- 
actions within this region of «5 with TM3 and TM5 of the receptor 
that consist of both polar and non-polar contacts. By contrast, there are 
fewer interactions for the A;R-G; complex and only D342 (G.H5.13) of 
the Go; subunit forms a hydrogen bond with Q210°°8 and a salt bridge 
with K213°7!, with the rest of the cluster making nonpolar interac- 
tions. The second key interaction region comprises the five C-terminal 
a5-helix amino acid residues that provide more receptor interactions 
for the Ga; than the Gas helix. These last five amino acids provide 
the strongest set of interactions for A; R-Gy (Fig. 3a, b), including inter- 


NATUR E|www.nature.com/nature 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


ARTICLE 


TMS 


Fig. 4 | Orientation of Ga subunits varies between GPCR-G-protein 
complexes. a, Cytoplasmic view shows «5-helix orientation difference 
between G; and G,. Interacting residues of the «5-helix (h5), TM7 and 
helix 8 are shown as sticks. The salt bridge is shown as a dashed line. 

b, Cytoplasmic view reveals a similar orientation of the Gay, «5-helix and 
a synthetic peptide derived from the Ga subunit of transducin (GaCT, 
purple; PDB code 3PQR) relative to active A;R and active Rho (in grey), 
respectively. The small inward movement of TM6 probably reflects a 

lack of additional contacts for Rho beyond the 11-mer GaCT fragment, 
whereas the A;R engages the entire Gj, heterotrimer. 


actions with TM2, TM3, TM5-TM7 and helix 8. The carboxyl group 
of F355 (GH5.26) forms a salt bridge with K228°”? (usually K/R in 
G;-coupled receptors”). L354 (GH5.25) extends from Ga into a hydro- 
phobic pocket lined with A;R 1232°%3, L236°°”, V203°°! and 1207. 
Residue D351 (GH5.22, D in Gay/2 and E in Gays) forms a salt bridge 
with R108? and K2945“. Notably, studies of chimeric G proteins have 
suggested that these five C-terminal «5-helix residues are generally 
sufficient for selective GPCR-G-protein coupling”, although other 
residues are also involved*!. Nonetheless, it is evident that this region 
makes a very different set of interactions with the A;R compared to 
the same region in the 82AR-Gs complex a5-helix, the latter charac- 
terized by weaker Van der Waals interactions (Fig. 3c). In addition to 
the «5-helical residues, D316 of the a4-86-loop of Gai stabilizes the 
interaction with the A,R via a salt bridge with K224°”> and K228°79 
(Fig. 3a, c). 

Although both Ga; and Ga, interact with TM3, TMS, ICL1 and TM6 
of their respective GPCRs (Extended Data Fig. 8), Ga; makes addi- 
tional contacts with the cytoplasmic end of TM7 (R2917°°) and helix 
8 (1292°” and R294’), which are absent in the 8, AR-G, structure 
(Fig. 4a). In addition, extra density near the unmodelled flexible ICL3 
of the A,R suggests the potential for additional interactions between the 
receptor and the Ga; 86-sheet. Notably, A;R L113!“ does not contrib- 
ute substantially to Ga; binding, in contrast to the equivalent residue, 
F139!Cl2, in 8.AR-G,. Indeed, L113!“ forms only a single Van der 
Waals bond with 1344 (GH5.15), whereas B,AR F139'C binds in an 
extensive hydrophobic pocket formed by the Ga, a5-helix, 82-83-loop 
and 6 1-strands (Fig. 3a—c). As in the 82AR-G, complex, ICL] and helix 
8 of the A,R are in close proximity to the Gg subunit (Extended Data 
Fig. 9a). The presence of multiple polar and charged amino acids at 
this interface suggests potential interactions; however, the side-chain 
density was insufficient for confident modelling. 

There have been numerous studies identifying a role for the Ga 
subunit «5-helix C terminus in determining G-protein selectivity for 
GPCRs*!, but less is known about receptor determinants that gov- 
ern GPCR selectivity for G proteins”. A receptor-based alignment 
of the A,R-Gj2 and 8,AR-G, complexes revealed that each receptor 
engages its G protein in a different orientation (Fig. 4a; Extended Data 
Fig. 8); compared to B.AR-G,, G; in the A;R-Gj2 complex is trans- 
lated by approximately 4.5 A relative to the receptor along the TM5- 
TMI vector. This translation arises from a difference in the position 
of the «5-helix relative to the receptor, leading to a movement of the 
rest of Ga; coupled with movements of GB and Gy subunits. There 
are two likely mechanisms contributing to the observed translation: 
conformational differences in the Ga subunits and distinct receptor 
conformations. With respect to the first mechanism, a comparison of 
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Fig. 5 | Schematic summarizing the key translational and rotational 
movements contributing to differences in G-protein coupling between 
A,R-G; with B,AR-G,. The receptor outline is that of the A;Rasa 
reference. Key receptor transmembrane domains and the Ga protein 
C-terminal «5-helices are shown as cylindrical helixes (82AR, green; A,R, 
blue); the rest of the Ga proteins are represented as contoured surfaces 
with Ga in pink and Ga, in gold. 


the nucleotide-free, receptor-bound conformations of the G; and G, 
a-subunits revealed that while, overall, they adopt a similar backbone 
conformation (r.m.s.d. value of 1.34 A), there are differences in the 
flexible loop regions and the «5-helix conformation (Extended Data 
Fig. 9b), which arise as a consequence of the last two turns of the Ga, 
a5-helix bending towards the a4-86-loop. This results in a 3.5 A shift 
in positioning of the G.H5.23 C, residue (Extended Data Fig. 9c). 
However, the bend in the Ga, «5-helix was not observed in cryo-EM 
structures’~°. The second mechanism that may lead to distinct orien- 
tations of Gai relative to Ga, in complex with their receptors is related 
to conformational differences in the receptors themselves, particularly 
TM6. The AjR structure displays a smaller outward movement of TM6 
relative to 8,AR-G,'° (10.5 A versus 14 A, respectively), with the a5-he- 
lix closer to TM7 and helix 8 that results in a difference in receptor 
interaction angle (approximately 18°) (Fig. 4a). This translation brings 
D351 (G.H5.22) within a salt-bridge distance of K294°, and both 
C352 (G.H5.23) and G353 (G.H5.24) within Van der Waals radius of 
R2917°6 and 1292°4”. A comparison of A;R-Gi with metarhodopsin II 
(Rho), the active state of rhodopsin, bound to a C-terminal fragment 
of transducin (GaCT) reveals a similar location of the two domains 
for Goy and transducin (Go) (Fig. 4b), consistent with the fact that 
Go, is most closely related to G; proteins*”. The key global conforma- 
tional differences between the 82AR-G, and A;R-Gj2 complexes are 
summarized in Fig. 5. Although it remains to be determined whether 
G-protein selectivity is mediated predominantly by specific receptor 
residue contacts as opposed to global conformational rearrangements 
shaping intracellular ‘pocket complementarity”, a key role for the 
degree of TM6 tilt is a possibility. However, this should be interpreted 
with caution because the distinct differences in the movement of TM6 
between the 3,AR-G, and A,R-Gj, structures and translation of the 
G protein could actually manifest as a result of three different underlying 
mechanisms: (i) true differences in receptor activation independent of 
G-protein coupling; this is not readily apparent upon comparison of 
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the A;R and A>,R structures (Extended Data Fig. 6); (ii) the signalling 
state of the solved receptor-G-protein complex being influenced by the 
nanobody (Nb35) present in the 8, AR-G, or the dominant-negative 
mutations present in the Gajz protein in our A;R-Gj structure; or (iii) 
a difference in the general activation mechanism for Gi.-coupled com- 
pared to G,-coupled GPCRs. This will ultimately require the solution 
of more GPCR-G-protein structures in as ‘native’ a state as possible. 
Nonetheless, a role for the degree of TM6 tilt in G-protein selectivity 
is consistent with previous molecular dynamics studies on B,AR 
complexes with C-terminal Gas- or Gaj-derived peptides** and this 
is supported in the available G protein complexed GPCR structures. 

In conclusion, this study presents the first, to our knowledge, active- 
state class A cryo-EM structure, highlighting the utility of this tech- 
nique to determine new GPCR structural information. The structure 
of the ADO-A,R-Gy heterotrimer provides important insights into the 
activation mechanism of the A;R in response to its endogenous agonist. 
It also allows the first comparison between different subtypes of hetero- 
trimeric G proteins bound to activated GPCRs, which may be pivotal 
in understanding pleiotropic coupling of these receptors. Finally, the 
findings have broad implications for understanding the structural basis 
underlying GPCR-G-protein selectivity, and can facilitate rational, 
structure-based, approaches for the design of subtype-selective AiR 
ligands as new therapeutic agents. 
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METHODS 


No statistical methods were used to predetermine sample size. The experiments 
were not randomized and investigators were not blinded to allocation during 
experiments and outcome assessment. 

Constructs. Wild-type human A;R was modified to include an N-terminal Flag 
tag epitope and a C-terminal 8 x histidine tag; both tags are removable by 3C pro- 
tease cleavage (Extended Data Fig. 1a). A DNGaj, construct was generated by site- 
directed mutagenesis to incorporate mutations that alter nucleotide binding (S47N, 
G204A and A327S) and a mutation (E246A) that improves the dominant-negative 
effect by weakening a salt bridge that helps to stabilize the nucleotide-bound con- 
formation®*>-**, These constructs were generated in insect cell expression vectors. 
Insect cell expression. A;R, human DNGoja, Hiss-tagged human G8; and Gy 
were expressed in HighFive insect cells (Expression Systems) using baculovirus. 
Cell cultures were grown in ESF 921 serum-free media (Expression System) to a 
density of 4 million cells per ml and then infected with either A;R baculovirus or 
both Gaz and GB). baculovirus, at a ratio of 1:1. Cultures were grown at 27°C 
and collected by centrifugation 60 h after infection. Cells were snap frozen and 
stored at —80°C for later use. 

Complex purification. Cells from either A,R or heterotrimeric G; expression were 
solubilized separately in 20 mM HEPES pH 7.4, 100 mM NaCl, 5 mM MgCh, 5 mM 
CaCh, 0.5% (w/v) lauryl maltose neopentyl glycol (LMNG, Anatrace), 0.01% (w/v) 
cholesteryl hemisuccinate (CHS, Anatrace) supplemented with cOmplete Protease 
Inhibitor Cocktail tables (Roche). Complex formation was initiated combining 
solubilized A,R and heterotrimeric G; and by addition of 1 mM ADO (Sigma) and 
apyrase (25 mU ml, NEB); followed by 2 h incubation at 4°C. Insoluble material 
was removed by centrifugation at 30,000g for 30 min and the solubilized complex 
was immobilized by batch binding to M1 anti-Flag affinity resin in the presence of 
5 mM CaCl). The resin was packed into a glass column and washed with 20 column 
volumes of 20 mM HEPES pH 7.4, 100 mM NaCl, 5 mM MgCh, 5 mM CaCh, 
lmM ADO, 0.01% (w/v) LMNG and 0.001% (w/v) CHS before bound material 
was eluted in buffer containing 10 mM EGTA and 0.1 mg ml! Flag peptide. 
The complex was concentrated using an Amicon Ultra Centrifugal Filter (MWCO 
100 kDa) and subjected to size-exclusion chromatography on a Superdex 200 
Increase 10/300 column (GE Healthcare) pre-equilibrated with 20 mM HEPES 
pH 7.4, 100 mM NaCl, 5 mM MgCh, 1 mM ADO, 0.01% (w/v) and 0.001% (w/v) 
CHS to separate complex from contaminants. Eluted fractions consisting of 
receptor and G-protein complex were pooled and concentrated. Final yield of 
purified complex was approximately 0.2 mg 17! insect cell culture. 

SDS-PAGE and western blot analysis. Samples collected from each purification 
step were analysed by SDS-PAGE and western blot. For SDS-PAGE, precast gra- 
dient TGX gels (Bio-Rad) were used. Gels were either stained by Instant Blue 
(Expedeon) or immediately transferred to PVDF membrane (Bio-Rad) at 100 V 
for 1 h. The proteins on the PVDF membrane were probed first with the primary 
mouse anti-His antibody (34660, QIAGEN), followed by washing and incubation 
with secondary anti-mouse antibody (680RD). The membranes were again washed 
and incubated with an AlexaFluor488 conjugated Gajz mouse monoclonal anti- 
body (sc13534, Santa Cruz) against Gaz. Bands were imaged using a Typhoon 
multimode imager (GE Healthcare Life Sciences). 

Preparation of vitrified specimen. For cryo-EM, purified A;R-G;2 complex was 
diluted to 1.3 mg ml! with 20 mM HEPES 7.4, 100 mM NaCl, 5 mM MgCh, 1 mM 
ADO. EM grids (Quantifoil, 200 mesh copper R1.2/1.3) were glow discharged for 
30 s using Harrick plasma cleaner (Harrick). 4 11 sample was applied on the grid 
in the Vitrobot chamber (FEI Vitrobot Mark IV). The chamber of Vitrobot was 
set to 100% humidity at 4°C. The sample was blotted for 4.5 s with blot force of 20 
and then plunged into propane-ethane mixture (37% ethane and 63% propane). 
Data acquisition. Datasets were collected on an FEI Titan Krios microscope oper- 
ated at 300 kV (FEI) equipped with a Gatan Quantum energy filter, a Gatan K2 
summit direct electron camera (Gatan) and a Volta phase-plate**” (FEI). Movies 
were taken in EFTEM nanoprobe mode, with 50 j1m C2 aperture, at a calibrated 
magnification of 47,170 corresponding to a magnified pixel size of 1.06 A. Each 
movie comprises 50 sub frames with a total dose of 50 e~ A~?, exposure time was 
12.5 s with the dose rate of 4.5 e~ pixels~! s~! on the detector. Data acquisition was 
done using SerialEM software at —500 nm defocus”. 

Data processing. A total of 3,220 movies were collected and subjected for motion 
correction using motioncor2“4. The movies were collected after installation of a 
new VPP. Each position on the VPP was used for 24 h. CTF estimation was done 
using Gctf software on non-dose-weighted micrographs. The particles were 
picked using gautomatch (developed by K. Zhang, MRC Laboratory of Molecular 
Biology, Cambridge, UK, http://www.mrc-lmb.cam.ac.uk/kzhang/Gautomatch/). 
An initial model was made using EMAN2* based on automatically picked few 
micrographs and using the common-line approach. The particles were extracted 
in RELION 2.01b1” using a box size of 180 pixels. Next, 831,602 picked particles 
were subjected to two rounds of 3D classification with three classes. After selecting 
the best-looking class with 263,321 particles, 3D auto-refinement was done in 


RELION 2.01b1. The final map was sharpened with a B-factor of —196 A”. Local 
resolution was determined using the internal local resolution procedure in Relion, 
using half-reconstruction as input maps. 

Model building. The initial receptor model for A,R-Gy, complex refinement was 
a chimaera between an active state A;R homology model, made using the SWISS- 
MODEL server**, based on the A2,—-mG, structure (PDB code 5G53) and the inactive 
AR structure (PDB code 5UEN). The initial model for the receptor-bound Gajz 
subunit was a chimaera between a Gay, homology model, based on the 8,AR-bound 
heterotrimeric G, structure (PDB code 3SN6), and a homology model based on 
the GDP-bound dominant-negative G; heterotrimer structure (PDB code 5TDH). 
G8 and Gy models were taken from the 8,AR-G, structure. Refinement, using the 
phenix.real_space_refine module in PHENIX™, was iterated with manual model 
adjustment and rebuilding in COOT™. Restraints for the agonist ADO were gen- 
erated using the GRADE server, https://www.globalphasing.com (v.12.13). Model 
validation was performed in MolProbity”!. For validation of model overfitting to 
the cryo-EM maps we displaced the model atoms (the same model used for the 
final refinement step) up to 0.5 A using ‘Shake’ in the CCPEM suite”. Subsequently, 
refinement was performed against the sharpened half-map 1 (HM1) (sharpened in 
Refmac using the full map as the reference**) using phenix.real_space_refine mod- 
ule in PHENIX” with the same parameters used in the final refinement step. Fourier 
shell correlation (FSC) curves were generated in the MTRIAGE module™ in Phenix 
for: 1) shaken pdb refined against HM1 versus HM1; 2) shaken pdb refined against 
HM1 versus HM2 (also sharpened in Refmac using the full map as the reference); 
3) the final pdb versus the full sharpened map (also sharpened in Refmac using the 
full map as the reference). Figures were prepared in USCF Chimera®? or PYMOL 
Molecular Graphics System, Version 2.0 (Schrédinger, LLC). 

Insect cell membrane preparations for [*H]DPCPX and [*°S]GTP*S binding. 
The A,R complex with wild-type heterotrimeric Giz (A, R, wild-type Gai2, GByy2), 
DNGj2 (AiR, DNGoj2, G12) or A;R alone were expressed in HighFive insect 
cells (Expression systems). Cells were collected approximately 48 h after the viral 
infection. For crude membrane preparations, cells were resuspended in membrane 
buffer (20 mM HEPES 7.4, 100 mM NaCl, 5 mM MgCh, 1 mM EDTA, with 
protease inhibitors and benzonase), dounced 20 times, followed by centrifugation 
(5 min, 350g, 4°C). The pellet was again resuspended in membrane buffer, dounced 
and clarified by centrifugation at low g. Membranes were pelleted by centrifugation 
(1h, 30,000g, 4°C), resuspended in the membrane buffer and sonicated. The pro- 
tein concentration was determined using Bradford reagent (Bio-Rad). 
Radioligand competition binding experiments in membranes from insect cells 
expressing the A,R. [7H]DPCPX binding was performed in 20 mM HEPES pH 
7.4, 100 mM NaCl, 5 mM MgCh, 0.1% BSA. Membranes were incubated for 1 h 
at 37°C with 1 nM of [>H]DPCPX and increasing concentrations of either ADO 
or cold DPCPX. Membranes were collected on UniFilter GF/C (Whatman) plates 
using Filtermate 196 harvester (Packard), extensively washed with ice-cold NaCl, 
dried and dissolved in 50 yl MicroScint-O scintillation cocktail (Packard) and 
counted using a MicroBeta LumiJET counter (PerkinElmer). Non-specific binding 
was determined in the presence of 10 1M SLV320. Competition binding curves 
between [7H]DPCPX and either unlabelled DPCPX or ADO were fitted to one- or 
two-site competition binding equations in Prism 6.0 (GraphPad). 

[°°S]GTP-~S binding in membranes from insect cells expressing the A,R. 
Measurement of [*°S]GTP'S incorporation to activated G proteins was performed 
in 20 mM HEPES pH 7.4, 100 mM NaCl, 10 mM MgCl, 1 mM EDTA, 0.1% BSA, 
30 pg mI! saponin. First, membranes (10 jig per sample) were pre-incubated with 
1M GDP and increasing concentrations of ADO for 45 min at 22°C. Reactions 
were then started by the addition of [*°S]GTPS and ATP to final concentrations 
of 300 pM and 50 \\M, respectively. After 1 h incubation of 30°C, the reaction 
was terminated by harvesting the membranes on Whatman UniFilter GF/C plates 
using Filtermate 196 harvester (Packard). Membranes were extensively washed 
with ice-cold 50 mM Tris pH 7.6, 10 mM MgCls, 100 mM NaCl, dried, dissolve in 
50 pl MicroScint-O scintillation cocktail (Packard) and counted using a MicroBeta 
LumiJET counter (PerkinElmer). 

Radioligand competition binding and cAMP accumulation assays in whole cells 
expressing the wild-type or mutant A,R. Generation of Chinese hamster ovary 
(CHO) FIpIN cell lines, stably expressing a 3x haemagglutinin (HA)-tagged wild- 
type human A,R or alanine substitution mutations (V87A; N159A), [>H]DPCPX 
competition binding, and inhibition of forskolin-stimulated cAMP accumulation 
by the agonist, NECA, were all performed as described previously**. Generation of 
the N159A mutation has also been previously described”*. For V87A, the following 
oligonucleotides were used to introduce the valine to alanine substitution at posi- 
tion 87; forward: CATGGTTGCCTGTCCGGCCCTCATCCTCACCCAG, reverse: 
CTGGGTGAGGATGAGGGCCGGACAGGCAACCATG. All other details were 
as previously described”*. Cells were routinely tested and confirmed to be free from 
mycoplasma contamination. 

Reporting summary. Further information on experimental design is available in 
the Nature Research Reporting Summary linked to this paper. 
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Extended Data Fig. 1 | See next page for caption. 
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Extended Data Fig. 1 | Expression and purification of the ADO-A,R- 
Gj. complex. a, Schematic of the haemagglutinin (HA) and Flag-tagged 
M4R-3C-A1R-3C-8 x His construct. The most conserved residue for 

class A GPCRs (X.50 class A numbering™) are highlighted for each 
transmembrane domain in red. b, Purification step flowchart for the 
A,R-Gi2 complex. c, SDS-PAGE/western blot of samples obtained at 
various stages of A;R-Gj2 purification. A,R and the Gj heterotrimer were 
expressed separately in insect cell membranes. Addition of ADO initiated 
complex formation, which was solubilized by detergent. Solubilized AiR 
and A,R -G,2 complex were immobilized on Flag antibody resin. 
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Flag-eluted fractions were purified by size-exclusion chromatography 
(SEC). An anti-His antibody was used to detect Flag~A;R-His and G8-His 
(red) and an anti-G;, antibody was used to detect Gajz (green). d, SDS- 
PAGE/Coomassie blue stain of the purified complex concentrated from 
the Superdex 200 Increase 10/30 column. e, Left, representative elution 
profile of Flag-purified complex on Superdex 200 Increase 10/30 SEC. 
Right, SEC fractions containing A; R-G;2 complex (within dashed lines) 
were pooled, concentrated and analysed by re-running on Superdex 200 
Increase 10/30 column. All images and SEC profiles are representative of 
more than three independent experiments. 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


ARTICLE 


oo 
oO 


% [PH]IDPCPX specific binding 
(o>) 
oO 


Log[DPCPX] M 


120 
~. 100 


WT Gi2) 


38S-GTPyS binding (% max 


HA-FLAG-3C-A1R-8xHIS 


Log [ADO] M 


@ HA-FLAG-3C-A,R-3C-8xHIS 


@ HA-FLAG-3C-A,R-3C-8xHIS : WT Gi2 


@ HA-FLAG-3C-A,R-3C-8xHIS : DN Gi2 


Affinity 


WT V87A 


Extended Data Fig. 2 | See next page for caption. 


RRKK 


N159A 


=< =k 
yO f+ DBD wo DON 
(oe © © ee © ee © 


% [PH]IDPCPX specific binding 
oO 


Log[ADO] M 


2 
= 120 
£ 
S 100 
2 
> «80 
®o 
2 60 
x< 
a el 
dl oo| © V8IA 
ja) 
So ean 
x2 ee eee ees 
10 9 2 7 6 6 «<4 
Log[NECA] M 
120 
“a @ WT 
se 100 
c @ V87A 
5 80 
= @ N159A 
2 60 
=< 
= 40 
a 
= 20 
{S) 
0. COrerrrrere rr errr ey rer 
a | 
10 9 8 7 6 -5 4-4 
Log[NECA] M 
Efficacy 
3 
< 
O2 
LU 
a1 
= 
a” 
_j-1 
RRKK 
-2 
WT V87A N159A 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


Extended Data Fig. 2 | Pharmacology of the A,R construct and 
rationally chosen mutations. a, b, Competition between the antagonist 
[SH]DPCPX and either unlabelled DPCPX (a) or ADO (b), in membranes 
expressing HA-Flag-3C-A1R-3C-8 x His construct in the absence or 
presence of wild-type (WT) Giz heterotrimer or dominant-negative (DN) 
Giz heterotrimer. Data are normalized to [7H]DPCPX binding in the 
absence of unlabelled competitor, with nonspecific binding determined in 
the presence of 1 11M of the antagonist, SLV320. c, ADO-mediated binding 
of [*°S]GTPS as a measure of G-protein activation by the HA-Flag—3C- 
A1R-3C-8 x His construct in High Five cells expressing receptor alone, or 
together with either wild-type or dominant-negative Gj. heterotrimer. 
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d, e, (3H]DPCPX competition assays (d) or inhibition of forskolin- 
stimulated cAMP accumulation (e), at the wild-type human A,R or two 
key alanine substitution mutations stably expressed in CHOFIpIn cells. 

f, Changes in agonist (NECA) affinity (K;) from the experiments shown 
in d. g, Changes in NECA signalling efficacy corrected for receptor 
expression (7,), determined from the experiments shown in e. Parameter 
estimates are the mean + s.e.m. determined from 3 (a-c) or 6-48 (d-g) 
independent experiments performed in duplicate. ****P < 0.0001 
(compared with wild type; one-way analysis of variance (ANOVA), 
Dunnett's post hoc test). Data for wild-type and N1594A are replotted from 
Nguyen et al”®. 
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Extended Data Fig. 3 | Cryo-EM of the ADO-A,R-G). complex. 
a, Representative VPP cryo-EM micrograph (of 3,220 recordings) of 
the ADO-A,R-G;, complex. b, Reference-free 2D class averages of the 
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complex in LMNG and CHS detergent micelles. c, Gold-standard Fourier 
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maps for overfitting validation (see Methods). 
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Extended Data Fig. 4 | Atomic resolution model of A;Rtransmembrane at 0.06. b—d, A;R residues (b, c,) and Gay «5-helix residues (d). The 
domains, the Ga protein «5-helix, ADO, and representative regions molecular model is shown in stick representation and the cryo-EM map in 
of GS and G7 in the cryo-EM density map. a, The molecular model is mesh contoured at 0.06. 

shown in stick representation and the cryo-EM map in mesh contoured 
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M6 Active A1-AR*-ADO 
Inactive A1-AR*PSB36 


Extended Data Fig. 5 | Comparison of active and alternative inactive (purple ball and sticks). ‘Toggle switch’ W247°* and residues within 4 A 
AR (PDB code 5NS2) structure. a—c, Side (a), extracellular (b) and of ADO are labelled and shown as sticks. Red rectangles highlight rotamer 
cytoplasmic (c) view of the ADN-A,R-Gy structure (blue) compared changes upon receptor activation. N, O and S atoms are coloured in blue, 
to the inactive PSB36-bound AjR (grey). d, e, Active ADO-AjR (d) and red and yellow, respectively. Dashed lines represent hydrogen bonds. 
inactive PSB36-A,R (e) receptor surfaces sliced to show binding site g, GPCR motifs important for receptor activation (DRY motif, purple; 


cavity. f, Orthosteric binding site of the active A;R-G;2 complex with ADO NPXXY motif, blue; PIF motif, green). 
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A1-AR*-ADO 
A2A-AR*NECA (PDB 5G53) 


A1-AR:ADO 
A2A-AR*NECA (2YDV) 


Extended Data Fig. 6 | Comparison of active A,R with active A2,R (PDB 
code 5G53) or agonist-bound ‘intermediate’ state A24R (PDB code 
2YDV). a-j, Side views (a, f), extracellular views (b, g) and cytoplasmic 
views (c, h) of the active ADO-A,R-Gj, structure (blue) compared to the 
active NECA-A>,,4R-mini-G, structure (a-e) or ‘intermediate NECA- 
AzaR structure (orange) (f-j). d, i, Orthosteric binding site of the active 
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A,R-Gi2 complex with ADO (purple ball and sticks) or Az,R with NECA 
(orange ball and sticks). “Toggle switch’ residue W°** and residues within 
4 A of ADO are labelled and shown as sticks. N, O and S$ atoms are coloured 
in blue, red and yellow, respectively. Dashed lines represent hydrogen 
bonds. e, j, Conserved class A GPCR motifs important for receptor 
activation (DRY motif, purple; NPXXY motif, blue; PIF motif, green). 
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negative Gaj2 with Ga,. a, AR alignment with BAR. b, Ga, alignment B2,R-Ga, structure. Dominant-negative Gaj2 point mutations are 
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shown in red. Grey bars indicate the positions of the a-helices in the 
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Extended Data Fig. 8 | Comparison of the A;R-Gj2 with B,AR-G, 
structures. Overlay of A;R-Gi2 with B,AR-G, (PDB code 3SN6) 
complexes. (A,R-Gj, is coloured as in Fig. 1; 82AR is in green, Ga, is in 
gold, G@ is in light cyan, G7 is in light purple). 
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Extended Data Fig. 9 | View of key residues at the interface of A,R and 
G@, and Ga conformations. a, AR is in blue and Gf is in dark cyan. 

b, c, Different views comparing DNGajz and Ga, from AR and B,AR 
(PDB code 3SN6) receptor-bound structures (DNGajz, pink; Ga,, gold). 


DNGai2 
Gas (PDB 3SN6) 


Spheres indicate the positions of the dominant-negative mutations on 
the DNGa with N, O and C atoms coloured in blue, red and pink, 
respectively. The «5-helix bend and loops that are the most different 
between Gay and Ga, are indicated. 
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Extended Data Table 1 | Cryo-EM data collection, refinement and validation statistics 


A1-AR:Giz 
(EMDB- EMD-7835) (PDB 6D9H) 


Data collection and 


processing 

Magnification 47170 
Voltage (kV) 300 
Electron exposure (e—/A?) 50e/A? 
Defocus range (nm) -500 
Pixel size (A) 1.06 
Symmetry imposed C1 
Initial particle images 831,602 
(no.) 

Final particle images 263,321 
(no.) 

Map resolution (A) 3.6 
FSC threshold 0.143 
Map resolution range (A) 3.4 - 4.2 

Refinement 

Initial model used (PDB 5UEN, 5G53, 3SN6, 5TDH 
code) 

Model resolution (A) 

FSC threshold 0.143 
Model resolution range 3.6 
(A) 

Map sharpening B factor -196 
(A?) 
Model composition 

Protein residues 893 

Ligand (ADO) 1 
B factors (A?) 

Protein residues 21-125 (avr. 54) 

Ligand (ADO) 44.5 
R.m.s. deviations 

Bond lengths (A) 0.007 

Bond angles (°) 12 
Validation 

MolProbity score 1.42 

Clashscore 4.18 

Poor rotamers (%) 0 
Ramachandran plot 

Favored (%) 96.6 

Allowed (%) 3.4 


Disallowed (%) 0 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


ARTICLE 


LETTER 


https://doi.org/10.1038/s41586-018-0246-4 


Macromolecular organic compounds from the 


depths of Enceladus 


Frank Postberg!*3-3*, Nozair Khawaja>!’, Bernd Abel*, Gael Choblet®, Christopher R. Glein®, Murthy S. Gudipati’, 
Bryana L. Henderson’, Hsiang-Wen Hsu®, Sascha Kempf®, Fabian Klenner!, Georg Moragas-Klostermeyer’, Brian Magee®®, 
Lenz Nolle!, Mark Perry", René Reviol', Jtirgen Schmidt", Ralf Srama’, Ferdinand Stolz*+", Gabriel Tobie°, Mario Trieloff!? & 


J. Hunter Waite® 


Saturn’s moon Enceladus harbours a global water ocean!, which 
lies under an ice crust and above a rocky core”. Through warm 
cracks in the crust? a cryo-volcanic plume ejects ice grains and 
vapour into space*”’ that contain materials originating from the 
ocean®?, Hydrothermal activity is suspected to occur deep inside 
the porous core!*"!, powered by tidal dissipation’. So far, only 
simple organic compounds with molecular masses mostly below 
50 atomic mass units have been observed in plume material®'*". 
Here we report observations of emitted ice grains containing 
concentrated and complex macromolecular organic material with 
molecular masses above 200 atomic mass units. The data constrain 
the macromolecular structure of organics detected in the ice grains 
and suggest the presence of a thin organic-rich film on top of the 
oceanic water table, where organic nucleation cores generated by 
the bursting of bubbles allow the probing of Enceladus’ organic 
inventory in enhanced concentrations. 

Two mass spectrometers onboard the Cassini spacecraft, the Cosmic 
Dust Analyzer (CDA) and the Ion and Neutral Mass Spectrometer 
(INMS), performed compositional in situ measurements of material 
emerging from the subsurface of Enceladus. These measurements were 
made inside both the plume and Saturn’ E ring, which is formed by ice 
grains escaping Enceladus’ gravity’®. 

The CDA records time-of-flight (TOF) mass spectra of cations gen- 
erated by high-velocity impacts of individual grains onto a rhodium 
target, with a mass resolution of m/Am & 20-50!*!”, Previous CDA 
measurements showed that about 25% of the ice grain spectra of the 
E ring, the so-called type-2 spectra, exhibit the presence of organic 
material®?!*, A subgroup (about 3%) of type-2 spectra (see Methods 
sections ‘Dataset’ and ‘Relative frequency of HMOC-type grains 
depends on impact speed and distance to Enceladus orbit’; Extended 
Data Table 1) is characterized by a sequence of repetitive peaks beyond 
80 u (where u is the unified atomic mass unit), usually separated by 
mass intervals of 12 u-13 u (Fig. 1). In most cases, this sequence 
extends to the maximum mass nominally covered by the CDA, about 
200 u. The broad and irregular shape of the peaks indicates that they are 
composed of multiple unresolved overlapping mass lines (Fig. la and 
Extended Data Fig. 1). The mass intervals between the peaks suggest 
organic species with an increasing number of carbon atoms (C; to Cj), 
which we refer to as high-mass organic cations (HMOCs). While an 
interval of 14 u would indicate the addition of a saturated CH2 group 
to an organic ‘backbone; the actual average mass difference of 12.5 u 
indicates the presence of predominately unsaturated carbon atoms. 

In principle, each HMOC peak could be derived from a different 
parent molecule. However, a monotonic decrease without major inten- 
sity variations from 77 u to 191 u (Fig. 1a, Extended Data Fig. 1) rather 


indicate fragments from higher-mass parent molecules (outside the 
CDAs nominal mass range) and not a conglomerate of several mole- 
cules with masses below 200 u. This interpretation is consistent with the 
observation that the detection probability increases with impact speed 
(see Methods, ‘Relative frequency of HMOC-type grains depends on 
impact speed and distance to Enceladus orbit’; Extended Data Table 1), 
increasing the available energy for fragmentation and ionization. A 
similar fragmentation pattern has been observed in impact experiments 
with polymers'*"? containing aromatic subunits (Extended Data Fig. 2; 
see Methods, ‘Inferring the origin of HMOC peaks in CDA spectra). 

In addition to the nominal mass spectrum (1 u to ~200 u), the CDA 
records extended TOF spectra of the same individual grains up to about 
8,000 u with resolution and sensitivity uniformly reduced by a factor of 
ten. These extended spectra exhibit abundant macromolecular cations 
with masses in excess of 200 u in most HMOC-type spectra (Extended 
Data Fig. 5). This further supports the interpretation that fragmentation 
of macromolecular parent molecules or networks creates the HMOCs. 

The HMOC sequence always appears together with intense non-wa- 
ter signatures below 80 u (Fig. 1), offering further compositional and 
structural constraints. Although there is some variation (Extended Data 
Figs. 1, 4), non-water peaks are arranged in six groups at 15 u, 27-31 u, 
38-45 u, 51-58 u, 63-67 u, and 77-79 u and, like the HMOC pattern, 
indicate a sequence of organic species with increasing number of car- 
bon atoms. Organic mass lines below 45 u appear preferentially at odd 
masses (Fig. 1b), indicating a typical cationic hydrocarbon fragmenta- 
tion fingerprint. However, from stoichiometry, the mass lines at 30 u, 
31 u, 44 u and 45 u cannot be pure hydrocarbon cations and indicate 
O- or N-bearing cations from hydroxyl (CH,OH*, CH3-CH-OH*), 
ethoxy or carbonyl functional groups or nitrogen-bearing ions (for 
example, CH >NH3 and CH 3-CH-NH; ) (Fig. 1b, Extended Data 
Fig. 4). 

A prominent peak preceding the HMOC sequence (Fig. 1b) indi- 
cates abundant cationic forms of a benzene ring, phenyl (CsH5*, 77 u) 
and benzenium (CsH;*, 79 u). The high abundance of these cations 
is highly diagnostic because the energetically favourable cationic aro- 
matic structure would be tropylium cations!®*° (C;H;*, 91 u) (Fig. 1a). 
Formation of tropylium cations must thus be inhibited, as it requires 
aromatic precursor molecules without hydrogenated C-atoms (for 
example, alkyl groups) attached to the ring”! (see Fig. 2, Extended 
Data Fig. 7 and Methods section ‘Inferring the origin of HMOC peaks 
in CDA spectra). 

A high abundance of aromatic structures is further supported by 
mass lines at 63 u-65 u (C;H sags 51 u-53 u (C,H tig) and 39 u (C3H3) 
(Fig. 1b, Extended Data Fig. 1), which can be interpreted as coincident 
unsaturated benzene fragments”! (Fig. 2, Extended Data Fig. 7). 
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Fig. 1 | Co-added CDA HMOC spectrum and mass line histogram. 

a, Time-of-flight spectrum representing the average of 64 high-quality 
spectra (see Methods, ‘Selection of 64 high-quality spectra for Fig. 1 and 
Extended Data Figs. 1 and 3’). The amplitudes of individual spectra were 
normalized and co-added. The corresponding masses of the peaks are 
labelled. The spectrum provides a representation of the average abundance 
of cation species. All signatures that are not colour-shaded are exclusively 
or mostly due to organic cations, as described in the text. Mass lines of 
exclusively inorganic origin are H;O* at 19 u and Na* at 23 u. Mass lines 
where interference with inorganic cations is likely are: H}O-H30* (37 u), 
H,O-Na + (41 u), (H2O).-H30* (55 u) and (H20)3-H30* (73 u). The 
CDAs target material, rhodium, and its water cluster forms cations at 103 u 
and 121 u (Rht, Rh-H,07*), respectively, that interfere with the HMOC 
pattern, but only in fast impacts (see Extended Data Fig. 3 for a 
comparison of fast versus slow impacts and Extended Data Fig. 4 for 
examples of individual CDA spectra; for a semi-quantitative overview of 
spectral features in individual spectra in the dataset, see Extended Data 
Fig. 1b). b, Occurrences (‘counts’) of resolved mass lines and ‘flank’ peaks 
in 64 high-quality HMOC spectra. In contrast to the spectrum shown in a, 


However, aliphatic fragmentation is also indicated by strong peaks at 
27 u-29 u (C,H3,;) and 41 u-43 u (C;H¢,,) anda less-pronounced 
peak at 15u (CH;). Mass lines at 55 u-57 u are in agreement with 
aliphatic Cy species (C,H? 4), whereas aliphatic structures with more 
than four carbon atoms (for example, C5H9 10,1; With 69 u-71 u) are 
generally absent (Fig. 1). 


140 1500 160 17 180 190 


the histogram makes no distinction of the peak amplitude and shows only 
the frequency of occurrence of resolved mass lines. Above 80 u the 
characteristic HMOC mass lines appear, preceded by the peaks from 
benzene-derived cations at 77 u and 79 u. A peak that can be identified at 
around 95 u in some spectra is in agreement with phenyl cations (C.H5*, 
77 u) forming a water cluster (CsHs-H20°*), preferentially at lower-impact 
speeds (Extended Data Figs. 1, 3). Organic mass lines below 45 u appear 
preferentially at odd masses, at 15 u, 27 u, 29 u, 39 u, 41 uand 43 u, indicating 
a preference for odd numbers of H atoms with the typical cationic 
hydrocarbon fragmentation fingerprint (CH{, C,H}, C,H?,C,H?, 

C,H? and C,H}). Although other interpretations for each individual 

line are possible (for example, HCN* (27 u), COH* (29 u) and CH3;CO* 
(43 u)), the overall pattern here suggests hydrocarbons or hydrocarbon 
fragments. In b all signatures with possible major contributions from 
inorganic species are colour-shaded as in a and marked by green triangles. 
In both panels, the absolute masses have an intrinsic uncertainty (absolute 
value) of +1 u at 80 u and +2 wat 180 u due to the limited calibration 
accuracy of the CDA in this high-mass regime. The mass intervals between 
peaks, however, are accurate to the integer level. 


Each spectrum also exhibits water-cluster cations of the form 
H,O(H,0) , (Fig. 1), typical for water ice impacts!*!”. Evidently, an 
ice-organic mixture constitutes the bulk composition of these particles. 
Ion abundances are indicative of an organic fraction in ice grains up to 
the per cent level (Extended Data Fig. 7). Particle radii are mostly 
between 0.2 jm and 2 j1m (Extended Data Table 1). Na* ions and 
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Fig. 2 | Formation of different aromatic cations. Although signatures 

in agreement with tropylium ions at ~91 u are present in every HMOC 
spectrum, they are on average about three times less abundant than the 
energetically favourable phenyl cation peak (Fig. 1a). If, upon breakup, 
the molecular structure allowed for an integration of a C-H group (top) 
into the highly stable aromatic resonance structure of the tropylium ion 
(91 u), then the formation of phenyl cations (77 u) would be suppressed 
(Extended Data Fig. 2). Only if the macromolecular structure does not 
allow this (bottom), phenyl (77 u) and benzenium (79 u, not shown) 
cations are the preferred single-ringed cations (also Extended Data Fig. 7). 
We note that fused benzene rings (such as PAHs) do not form either of 
these abundant single-ringed species (see Methods section ‘Inferring 

the origin of HMOC peaks in CDA spectra’ and Extended Data Fig. 8). 
Subsequent losses of neutral acetylene (C2H2) from phenyl and tropylium 
cations lead to smaller cationic fragments at 65 u, 39 uand 51 u?!, which 
are also seen in HMOC spectra (Fig. 1b). 


sodium-water clusters (Na(H,0) * *) appear in most HMOC spectra, 
but at a much lower level than in spectra of type-3 grains, which ng 
thought to be generated from frozen spray of Enceladus’ salty ocean® 

In HMOC-type spectra the Na-to-water ratio is similar to the low abun- 
dance found in type-1 particles thought to condense from salt-poor 
water vapour®?, 

Cassini’s INMS has measured the integrated composition 
of the Enceladean plume at several flyby speeds’. In contrast 
to the CDA, which records cations that form upon impact, the 
INMS simultaneously measures the composition of neutral gas 
entering the instrument aperture and volatile neutral molecules 
that are generated upon the impact of ice grains onto the instru- 
ment’s antechamber”’ (see Methods, ‘INMS data analysis’). There is a 
striking overabundance of organic species in spectra obtained 
at high flyby speeds (14-18 km s~') compared to those obtained at 
slower velocities (7-8 km s~') that we attribute to fragmentation of 
large organic parent molecules beyond the upper INMS mass limit 
of 99 u (Fig. 3). Given the low temperatures, species above 99 u are 
expected to be extremely depleted in the plume gas and originate from 
ice grains, probably of HMOC-type, entering the INMS aperture at 
high speed. 

Mass lines at 77 u-78 u stand out, in full agreement with the CDAs 
inference of benzene species. Because these are only apparent at high 
flyby speed, they cannot stem from benzene itself but, like the other 
species in the residual spectrum, must be fragments from larger par- 
ent molecules. As in the case of the CDA, some of the smaller require 
oxygen-bearing fragments. Further INMS compositional analysis of 
signals extracted explicitly from ice grain impacts (see Extended Data 
Fig. 10 and Methods section ‘INMS ice grain spectrum’) provides fur- 
ther evidence that CO is the dominant fragment species at 28 u and 
that a N-bearing fragment (C2H3N) might be present. 

Despite the relatively low mass range and resolution of the Cassini 
mass spectrometers, the measurements lead to the following key con- 
straints. (1) The HMOC pattern in the CDA spectra and the extended 
spectra support the presence of organic molecules with masses clearly 
above 200 u. (2) The typical spacing of 12 u-13 u between HMOC peaks 
implies unsaturated cationic fragments with a ratio of C/H ~ 2. Impact 
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Fig. 3 | INMS organic fragmentation spectrum. By subtracting a plume- 
integrated INMS mass spectrum acquired at low velocities from a high- 
velocity spectrum, we obtain this residual signal (in arbitrary units), which 
we attribute to breakup products from high-mass species beyond the 
INMS mass limit at 99 u (Methods ‘Residual spectrum from fast versus 
slow flybys’; Extended Data Fig. 11). Abundances of these high-speed 
fragments are normalized to the abundance of CH, (via its CH} ion at 

15 u, which therefore has no signal in the residual spectrum). The species 
at 77 u and 78 u match the benzene-derived cations at 77 u and 79 u in the 
CDA spectrum. In contrast to the CDA, the INMS measures the 
composition of the neutrals and thus the neutral benzene ring (78 u) is 
expected to have the strongest signal in this spectrum. Many of the more 
abundant low-mass species are in agreement with aromatic fragmentation 
and oxygen-bearing parent molecules. Additional mass lines only seen at 
high-speed flybys (50 u-56 u) and much elevated signals in other masses 
(37 u-42 u and 25 u-28 u) indicate the unsaturated fragments C4H>_-s, 
C3Hj)-¢ and C)H}-4, consistent with aromatic breakup products. Aliphatic 
fragmentation is required for the high-mass end of the C2 and C; regions 
(29 u-30 u and 43 u). The overabundance at 28 u (CO), 30 u (probably 
H2CO) and 31 u (probably CH30H) requires a contribution of oxygen- 
bearing fragments, consistent with CDA observations. 


experiments with polystyrene'®!° (Extended Data Fig. 2) indicate that 
the C/H ratio of the parent molecules might be lower (C/H ~ 1) than 
in these fragments. (3) Prominent benzene-like species in the CDA 
and INMS spectra indicate the presence of abundant sub-structures of 
isolated benzene rings. From the INMS spectra, these features cannot 
originate from benzene itself, but are fragments of larger molecules. 
However, polycyclic aromatic hydrocarbon (PAH)-like fused rings do 
not form such fragments. Although not unique, the most parsimonious 
interpretation of these results is that these aromatic structures are parts 
of the same massive parent molecules that are responsible for HMOCs, 
which would explain their unsaturated nature. (4) From the suppressed 
formation of tropylium cations observed in the CDA spectra, we con- 
clude that the rings are either connected to functional groups without 
carbon atoms or to dehydrogenated carbon atoms. (5) Unsaturated 
species at low masses are in close agreement with aromatic fragmenta- 
tion. Aliphatic cations indicate saturated aliphatic structures with one 
to four C atoms arranged in parallel with the unsaturated (aromatic) 
structures. (6) Oxygen-bearing species in both the CDA and INMS 
spectra probably originate from hydroxyl, ethoxy or carbonyl func- 
tional groups. Nitrogen-bearing species are in good agreement with 
some features but do not provide a unique interpretation. Although 
no indication of other elements is observed in the organic structures, 
these cannot be ruled out. 

Fragmentation of a single type of macromolecule may be respon- 
sible for all mass lines in the HMOC spectra. The parent substance 
would then be composed of cross-linked or polymerized aromatic and 
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aliphatic substructures with functional groups containing oxygen and 
probably nitrogen. Alternatively, contributions from lower-mass species 
might be mixed with macromolecules. 

The Enceladean origin of these macromolecules is evident 
(see Methods sections ‘Enceladus as the origin of HMOC parent mol- 
ecules and exclusion of other potential sources’ and ‘Contamination 
of INMS spectra from previous measurements is unlikely’), and there 
is a multitude of speculative options for the genesis of these complex 
organics on Enceladus, some of which are outlined in Methods sec- 
tion ‘Possible precursor scenarios for the observed complex organics. 
However, the data do provide evidence of the formation of highly 
organic-enriched ice grains under quite specific conditions. Ice grains 
of very different salinity emerging from Enceladus have previously been 
identified in CDA mass spectra. Nearly pure water ice grains (type 1) 
can form from condensation of supersaturated vapour inside and above 
the ice vents®**4, In contrast, salt-rich ice grains (type 3) are thought to 
be frozen ocean spray*”, generated when bubbles of volatile gas (CO2”, 
CH, or H,!”) reach the water table and burst”®. The HMOC-producing 
organic material (see Methods, ‘Enceladus as the origin of HMOC par- 
ent molecules and exclusion of other potential sources’) is detected in 
type-2 salt-poor ice grains, and thus cannot have formed directly from 
the salty ocean spray that preserves the composition of ocean water 
upon flash freezing’. Consequently, the observed organic compounds 
were not dissolved in ocean water when incorporated into ice grains. 
However, molecules with masses much larger than 200 u should not 
exist in the gas phase near the water table (T' < 0 °C); hence, the organic 
substances cannot have condensed from the vapour either, and must 
have been solid when the grains formed. 

The most plausible way to generate the observed grains is if the 
organic material exists as a separate phase, such as a thin film or layer 
of mostly refractory, insoluble organic species on top of (at least parts 
of) the oceanic water table located inside water-filled cracks in the ice 
crust (see Extended Data Fig. 12 and Methods section “Deduction of 
an organic enriched layer at the Enceladean water table’). Bursting 
bubbles then disperse the organic film and, besides salty water drop- 
lets, produce droplets or flakes of organic material. When ascending 
in the icy vents, the organics become coated by water ice condensing 
from the vapour carrying the grains (Extended Data Fig. 12). Indeed, 
the limited tendency of benzene cations in HMOC spectra to cluster 
with neutral water molecules disfavours intimate mixing of organics 
with water and implies a core-shell grain structure. In a well mixed 
system, such a cluster at 95 u is much more pronounced (Extended 
Data Fig. 7). 

Aerosol formation from bubble bursting is a well studied process in 
Earth's oceans, which are covered by an organic microlayer”®. Organic- 
bearing sea spray serves as highly efficient nucleation cores of ice clouds 
over polar waters on Earth”’ and is found preferentially in the smallest 
aerosols, between 40 nm and 250 nm in size*®8, whereas larger aerosols 
with sizes between 500 nm and 1,000 nm are either organics mixed 
with salt or do not contain organics”. Under Enceladus’ low-gravity 
conditions, one would expect larger gas bubbles and therefore larger 
film drops. Spectra showing HMOCs are generated by ice grains with 
radii of around 1 j1m (Extended Data Table 1). The presence of insolu- 
ble organic nucleation cores, a few hundred nanometres in size, would 
naturally explain the high organic content in HMOC grains. Larger, 
organic-free salty ocean droplets account for the salt-rich type-3 parti- 
cles detected by the CDA. Bubbles ascend through tens of kilometres of 
ocean before reaching the surface. Like in Earth’s oceans, organic sub- 
stances can accumulate efficiently on the bubble walls*’, thus probing 
the oceanic organic inventory at depth (see Methods, ‘Deduction of an 
organic enriched layer at the Enceladean water table’). 


Online content 

Any Methods, including any statements of data availability and Nature Research 
reporting summaries, along with any additional references and Source Data files, 
are available in the online version of the paper at https://doi.org/10.1038/s41586- 
018-0246-4. 
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METHODS 

CDA data analysis. Short description of the CDA’ chemical analyser subsystem. The 
chemical analyser is the subsystem of the CDA*" that provides chemical informa- 
tion about an impacting dust particle. Depending on its trajectory, the particle 
either hits the central rhodium target (chemical analyser target, CAT; diameter, 
0.17 m), the surrounding gold target (diameter, 0.41 m) or the inner wall of the 
instrument. This work only deals with impacts on the CAT. If a dust particle 
impacts the CAT with sufficient energy, it becomes totally vaporized and partly 
ionized, forming an impact plasma consisting of target and particle ions, together 
with electrons and neutral molecules and atoms. The instrument separates this 
plasma, the positive component of which is linearly accelerated towards a multi- 
plier about 19 cm away from the CAT, which is used to generate a TOF spectrum. 
The spectrometer is sensitive to positive ions only. The mass resolution, m/Am, 
which is derived from laboratory experiments with the instrument, depends on 
the atomic masses of the ions. At 1 u, m/Am is 10, increasing to m/Am = 30 at 
100 u and up to m/Am = 50 at 190 u—although these values vary strongly with 
impact conditions. 

Acquisition of a spectrum in the high-rate sampling mode of the multiplier can 
be triggered by charge thresholds being exceeded on either the CAT or the multi- 
plier. In the latter case, recording can be triggered up to several microseconds after 
the actual impact by the arrival of an abundant ion species; for water-dominated 
particles, these generally are hydrogen cations (H*) or hydronium ions (H30*). 
If triggered by the arrival of H3O*, mass lines of species with masses lower than 
19 u, as well as the hydronium signature itself, do not appear in the spectrum. In 
this work, all spectra were triggered either directly by impact or by registering the 
Ht signal at the multiplier. 

The spectra are logarithmically amplified, digitized at 8-bit resolution and 
sampled at 100 MHz for a period of 6.4 1s after the trigger. The recording period 
of the high-rate sampling mode allows the detection of ions with masses of up 
to approximately 185 u, assuming that instrument recording is triggered by the 
impact itself, and about 215 u when recording is triggered by Ht. The acquisition 
of the extended spectrum starts after the high-sampling mode ends at 6.4 j1s and 
is recorded with a sampling rate of 10 MHz up to a total time of 40 js. The low 
number of data points corresponds to an extremely low mass resolution in the 
extended spectrum: one data point interval represents 7 u at 300 u and 20 u at 
2,000 u. The extended spectrum frequently shows an instrument-artefact peak at 
6.8 1s, which is ignored in our analysis. 

Since the TOF is proportional to the square root of the mass-to-charge ratio of 
ions, in the ideal case its spectrum also represents a mass spectrum for identical 
ion charges. The ions created by impact ionization in the impact speed regime 
considered here (4-18 km s~') are almost exclusively singly charged. Unfortunately, 
the TOF is also influenced by the broad distribution of initial ion velocities, slightly 
varying flight paths and plasma shielding effects”. For that reason, species with 
identical masses are distributed over a range of sampling points and the mass 
resolution drops below integer values usually around 25 u-30 u. Therefore, we 
prefer to display original CDA spectra using a horizontal axis that shows the TOF, 
not the mass. The latter could incorrectly suggest an unrealistically high accuracy. 

For a detailed description of the instrument, see Srama et al.3". The calibration 
routine is described in Postberg et al.'*. For the dataset used in this work the 
calibration was done by using water and sodium mass lines, and a stretch factor 
of a = 474 ns was applied. 

Dataset. All data used in this work are listed in Extended Data Table 1 and, 
like all CDA data, are archived on the Small Bodies Node of the Planetary Data 
System (PDS-SBN), at http://sbn.psi.edu/archive/cocda. We used 15 periods with 
good pointing of the CDA towards the E-ring dust ram direction between 2004 
and 2008. This early period was chosen because there was negligible CDA con- 
tamination from salts deposited on the CDA impact target during deep Enceladus 
plume dives!”. These 15 following periods (expressed in seconds), as recorded by 
the spacecraft clock, are: 
1,477,548,025-1,477,746,968 
1,489,004,979-1,489,118,561 
1,498,469,760-1,498,505,448 
1,506,223,870-1,506,436,625 
1,509,311,996-1,509,357,627 
1,511,686,278-1,512,480,582 
1,514,090,477-1,514,186,815 
1,515,560,669-1,516,328,727 
1,519,522,033-1,519,612,908 
1,521,596,023-1,522,361,262 
1,543,798,083-1,543,831,204 
1,544,838,694—-1,544,916,311 
1,557,517,652-1,557,520,425 
1,589,065,736-1,589,083,623 
1,605,559,719-1,605,566,052 


LETTER 


Within these periods, a total of 7,353 spectra of impinging E-ring grains were 
investigated. A Lee filter was applied for better signal-to-noise ratio. Within the 
group of organic-bearing spectra (type 2®*"“) we find a subgroup of spectra with a 
pattern of repetitive non-water peaks above 80 u (Fig. 1). We attribute this spectral 
pattern to HMOCs. The selection criteria are the following. 

(1) At least five roughly well defined, equidistant peaks above 80 u with a signal- 
to-noise significance greater than 20. 

(2) Water cluster peaks of the form (H2O),X* (where X* is H3;O* or Rh*, for 
example) are explicitly not considered for selection. Cases in which an HMOC 
pattern could be verified despite interference with water clusters were selected, but 
not considered for the sub-selection high-quality spectra (see Methods, ‘Selection 
of 64 high-quality spectra for Fig. 1 and Extended Data Figs. 1 and 3’). 

(3) Isolated or irregular peaks above 80 u are not considered. 

The much wider group of CDA type-2 grains! includes a multitude of organic- 
bearing ice grains observed in the plume and in the E ring. Other type-2 spectra 
not belonging to the type-2 subtype displaying HMOC patterns are outside the 
scope of this work. 

Relative frequency of HMOC-type grains depends on impact speed and distance 
to Enceladus orbit. We find 83 spectra (1.1%) with apparent HMOCs, equivalent to 
about 3% of all type-2 spectra in the dataset (Extended Data Table 1). However, the 
probability of HMOC detection depends on the impact speed and thus the energy 
densities that the organic-bearing ice particles were exposed to upon impact. At 
high speeds (8-15 km s~') the overall frequency of HMOC detection is about 3% 
of all particles (~10% of type-2 spectra). At 6-8 km s~! the HMOC detection fre- 
quency drops sharply to about 1% and goes down to 0.4% between 5 km s~! and 
6 km s~! (see Extended Data Table 1). Below 5 km s~! no HMOC was detected, 
equivalent to a probability of less than 0.1%. In the few spectra in the dataset 
recorded at speeds above 15 km s~!, no HMOCs were detected (equivalent to a 
probability of <1.7%), so the speed range 8-15 km s~ seems to be optimal for the 
CDA to record HMOC spectra. At speeds below 8 km s“! the probability to pro- 
duce fragments from the proposed parent molecules (outside the CDA mass range) 
decreases and becomes close to zero below 5 km s~!. At speeds above 15 kms"! 
the detection probability seems to decrease again, probably because at these high- 
impact speeds mostly small fragments with masses <80 u are produced, whereas 
the production of fragments that yield the typical HMOC pattern between about 
80 uand 200 u (the defining feature of this CDA spectrum type) becomes less likely. 

HMOC grains were detected in the plume (Extended Data Fig. 6) and in the 
E ring in a wide range of Saturnian distances from 3.8Rs to 14Rs, where Rs is 
Saturn's radius, except in its outermost fringe, which extends* out to 20Rs. 
However, the fraction of HMOC-type grains in the E ring increases closer to 
Enceladus’ orbit. The chance to encounter an HMOC grain with intermediate 
impact speed of 6-8 km s_! is about 2.5% at a Saturnian distance of 3.8Rs—5Rs 
and drops to 1.5% at 5Rs-7Rs. Beyond 7Rsg, the detection probability decreases 
sharply to 0.5% and is 0.3% beyond 11Rs. This clearly points at an Enceladus origin 
(see ‘Enceladus as the origin of HMOC parent molecules and exclusion of other 
potential sources’). 

The effects of speed and distance are cumulative: in the optimal speed window 
and close to the orbit of Enceladus, the detection rate is highest, at about 4%. Thus, 
we can conclude that the actual fraction of ice grains carrying substantial amounts 
of complex organic material ejected into the E ring is at least 4% of the grains in the 
size range covered by the CDA (grain radius r ~ 0.2-2 jm). An instrument with 
an extended mass range (for example, up to 2,000 u) and higher resolving power 
than the CDA would probably detect complex organics with similar or higher 
probability at encounter speeds of ~5 km s_! with the organic parent molecules 
being largely intact. 

Selection of 64 high-quality spectra for Fig. 1 and Extended Data Figs. 1 and 3. 
For the peak statistics and co-added spectra in Fig. 1 and Extended Data Fig. 3 and 
the semi-quantitative spectra depicted in Extended Data Fig. 1, we used a subset 
of 64 spectra that had the highest quality. The 19 spectra that were not selected 
either had: 

(i) an extended water-cluster pattern (for example, (H2O),-H30* or (H2O),- 
Rh*), where strong peaks overlapped with the HMOC region above 100 u, or 

(ii) an HMOC pattern with a low signal-to-noise ratio and poor peak definition 
that did not allow a reliable quantification of all features, or 

(iii) an irregular baseline (instrument artefact) that did not allow a reliable quan- 
tification and would have spoiled the quality of the co-added spectra, or 

(iv) an uncertain calibration, which did not allow an unambiguous assignment 
of peak masses. 

In cases (i) and (ii), the organic fraction in the ice grains is probably lower than 
in other HMOC spectra. 

In Extended Data Fig. 3 the dataset is split into ‘fast’ and ‘slow’ impacts (above 
and below ~10 km s“1, respectively). The fast co-added spectrum is composed of 
12 of the 64 high-quality HMOC spectra recorded after an impact with a velocity 
higher than ~10 km s~1. In this case the speed was determined by the presence of 
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one or more hydrogen cations (H+, H}, H+). The uncertainty of this method is 
clearly below the uncertainty achieved when using assumptions on the orbital 
elements, shown in Extended Data Table 1. The appearance of H* cations is a 
reliable indicator of impact speeds!” in excess of about 12 km s~!, and H} and 
H} cations appear already at slightly lower speeds (>10 km s~!)!”. For 12 spectra 
in which one or more of these hydrogen mass lines were present, this reliable speed 
threshold indicator was used. The remaining high-quality spectra were co-added 
to produce the slow spectrum in Extended Data Fig. 3. Owing to their higher 
kinetic energy upon impact, low-mass cationic fragmentation species are more 
abundant there. Moreover, fast spectra were on average triggered earlier, mostly 
directly after impact (see ‘Short description of CDAs chemical analyser subsysten). 
Therefore, the high-rate recording interval of 6.4 1s stopped at a lower equivalent 
mass than for the lower-speed spectra (Extended Data Fig. 1), and the HMOC peak 
at ~191 u was often not recorded for fast spectra. 

Inferring the origin of HMOC peaks in CDA spectra. In the main text we argue 
that the HMOC pattern is indicative of highly unsaturated organic cations. A very 
similar pattern was also observed in CDA ground calibration experiments that 
used a dust accelerator** to produce impacts from micrometre-sized polystyrene 
beads (see figures 10 and 13 in Goldsworthy et al.!*). The accelerator experiment 
was later repeated using an advanced TOF dust analyser with higher mass resolu- 
tion!® (Extended Data Fig. 2), giving identical results. A similar, but not identical, 
unsaturated cationic fragmentation pattern was observed in TOF-SIMS (second- 
ary ion mass spectrometry) experiments with polystyrene*’. The most plausible 
explanation for the fragmentation pattern and the ionic species observed is given 
in table 2 of ref. *”. Although the spacing of 12.5 u in the HMOC pattern seems to 
suggest a C/H ratio of 2 or even higher, all three polystyrene experiments show 
that such an unsaturated cationic fragmentation pattern can also be achieved with 
parent molecules with a C/H ratio of only 1. These analogue experiments demon- 
strate that the C/H ratio of the cationic fragments could be different from that of 
the bulk material. 

However, there is a noticeable difference between polystyrene spectra and 
HMOC spectra: in contrast to HMOC spectra, tropylium ions (91 u) form abun- 
dantly in impact ionization polystyrene spectra, whereas phenyl cations (77 u and 
79 u) are relatively depleted. In Fig. 2 we demonstrate the reason: in polystyrene, 
polymer alkyl groups are attached to each of the aromatic monomers, which allow 
easy formation of tropylium ions at 91 u (Extended Data Fig. 2). This is also known 
from classical electron ionization mass spectrometry: abundant phenyl cations 
form only if the formation of tropylium ions is sterically difficult or impossible”??!. 
This leads to our conclusion that for HMOC parent species, single-ringed substruc- 
tures are predominately attached to dehydrogenated carbon atoms or to functional 
groups without carbon. 

In the dust accelerator experiments with polymers, an HMOC-like pattern was 
observed at impact speeds!*!? of 4-6.5 km s~!. Above this speed further fragmen- 
tation occurred. By contrast, CDA HMOC spectra are observed at impact speeds 
of about 5-15 km s~!. The water ice matrix in which the HMOC parent molecules 
are embedded might explain this difference. It is well known from matrix-assisted 
laser desorption/ionization experiments that water protects large organic molecules 
from fragmentation very efficiently. 

An alternative interpretation of the origin of HMOCs might be very large PAHs. 
However, in our laser-assisted analogue experiment with pyrene in a water matrix 
(see Extended Data Figs. 8, 9 and Methods section ‘Laser dispersion analogue 
experiments for icy dust impacts’), we do not observe efficient fragmentation of 
the PAH. These findings generally agree with information from electron ionization 
(70 eV) mass spectrometry of a wide range of PAHs. By contrast, Stephan et al.** 
demonstrated a pattern similar to the one observed with the CDA above 100 u 
using TOF-SIMS of a mixture of high-mass PAHs. However, the applied energies 
were much higher (25 keV) and the PAHs were not embedded in a water matrix, 
which naturally protects organic molecules from breakup*. In any case, regardless 
of fragmentation at harsh ionization conditions, PAHs do not yield any monoar- 
omatic peaks; hence, neither phenyl (77 u) nor tropylium (91 u) cations form*”. 
Therefore, if PAHs have a role in the origin of HMOCs, other parent molecules 
must be responsible for the strong monoaromatic peak in the CDA and INMS 
spectra. 

In principle, monoaromatic cations in the CDA spectra (77 u, 79 u) could origi- 
nate from benzene mixed within the macromolecular structure responsible for the 
HMOCs. However, this is not possible for the benzene feature (78 u) seen in the 
INMS spectra because they exclusively appear at high-speed flybys and thus must 
be fragments of larger molecules. It is plausible that the monoaromatic features 
observed in both instruments originate from HMOC-type ice grains (see next 
section) and therefore also in CDA spectra we consider molecular fragmentation 
as the most likely source. 

INMS data analysis. Residual spectrum from fast versus slow flybys. The INMS*! 
measurement routinely integrates the composition of plume gas over the flyby to 
increase the signal-to-noise ratio. Using this methodology, volatile compounds 


released from ice grains that fragment and evaporate upon impact onto the instru- 
ment’s antechamber?’ cannot be discriminated. In contrast to the CDA, which 
records cations, the INMS observes volatile neutral molecules that are in the vapour 
phase or are generated by ice grain impacts in the plume, and are subsequently 
ionized by the impact of 70-eV electrons in the ion source. 

The residual spectrum presented in Fig. 3 represents the difference in com- 
position and overabundance of organic material observed by the INMS during 
high-velocity flybys compared to low-velocity flybys through the Enceladus plume 
(Extended Data Fig. 11). Our interpretation of this spectrum is that low-speed 
flybys result in minimal fragmentation of the organic compounds within the ice 
grains. By contrast, high-speed impacts lead to fragmentation of large organic 
parent molecules beyond the INMS mass range of 99 u. Species with molecular 
masses above 99 u in the plume are expected to be extremely depleted in the gas 
phase and have to reside inside or on the ice grains. Based on the knowledge of 
the compositionally different ice grain populations that the CDA has collected 
over many years, HMOC-type grains entering the INMS’s aperture are the most 
plausible source. The similarity in the composition of the fragments seen by the 
CDA and the INMS reinforce this assertion. 

To obtain the spectrum shown in Fig. 3 we took the summed spectrum of the 
E5 flyby'® and subtracted the average summed spectrum of the E14, E17 and E18 
flybys. These represent the fastest (E5 at ~18 km s~!) and slowest (E14, E17 and 
E18 at ~8 km s_') flybys through the plume for which the INMS has the best data. 
The three slow flybys have been combined into an average spectrum to compensate 
for their lower signal-to-noise ratio. Velocity-driven processes complicate the spec- 
tral comparison. HO-Ti reactions in the INMS antechamber increase with veloc- 
ity owing to vaporization of Ti from the walls of the instrument’s antechamber and 
create excess H, from H,O, leaving TiO and TiO; as by-products). Likewise, CO, 
fragments to become CO more readily as the velocity increases. Lastly, fragmen- 
tation products of high-mass organics probably enhance the plume gas signal for 
all masses dominated by organics in the INMS mass range. This makes it difficult 
to set an accurate reference point with which to compare the high- and low-veloc- 
ity spectra. We have chosen to use the signal at 15 u in both the high- and low-ve- 
locity spectra as the common reference point because the CH} dissociative ion 
from CH, dominates the signal at this mass; furthermore, CH, is considered to be 
a plume-derived volatile species not considerably altered by fragmentation of high- 
mass organics. Choosing a mass of 15 u as the common reference point removes 
all the signals at this mass for the residual spectra, whereas in all other masses the 
signal is positive or negative. After matching the two spectra at their 15 u signal 
(Extended Data Fig. 11) and subtracting the low-velocity spectrum from the 
high-velocity spectrum, we set the amplitude scale of the residual relative to the 
noise floor of the low-velocity flybys (about 5 orders of magnitude lower than the 
HO signal at 18 u) so that only residual signal with a high signal-to-noise ratio is 
visible. The residual signal presented in Fig. 3 can be considered to be a conserv- 
ative estimate of the high-mass organic fragmentation effect; if the CH, in the 
high-velocity flyby was increased from the low-velocity flybys, the residual signal 
would be even larger in amplitude. 

Hydrocarbon species observed in this conservative estimate of the residual 
signal are discussed in the main text. Moreover, the residual spectrum suggests 
that at least 50% of the 28 u peak is from CO, given the upper limits placed upon 
N> and C,H, in the fast E5 spectrum!°. The oxygen-bearing species H»CO and 
CH30OH are suggested as the likely sources of the residual signal at 30 u and 31 u 
because they are the dominant species for these peaks in the E5 spectrum! and 
the residual retains more than 60% of the original E5 signal. This interpretation is 
strongly supported by the INMS ice grain spectrum. 

INMS ice grain spectrum. To isolate spectral signals from icy grains when cross- 
ing the plume, we extracted those parts of the high-time-resolution, real-time 
INMS plume spectra (every 2.3 s) that registered the ice grains as spikes in the 
closed-source neutral (CSN) data”>?. The quadrupole mass analyser has a mass 
step integration period of 34 ms, whereas the vaporized gas from the impact dis- 
sipates in the antechamber in less than 3 ms. Therefore, the ice impacts appear as 
gas bursts or ‘spikes’ superimposed on the smoothly varying gas spectra that are 
mass-filtered by the mass step of the analyser. Thus, each spike can register only 
one unity-mass location per ice impact. By summing the spikes over three similar 
encounters (E14, E17 and E18) there are sufficient mass sampling statistics over 
the full sampled mass range to construct a pseudo-mass spectrum that represents 
the composition of the micrometre-sized grains. 

Extended Data Fig. 10 shows this pseudo-mass spectrum of the grains meas- 
ured by the INMS within 100 s of the time of closest approach. Each of the three 
encounters provided approximately 200 individual measurements or spikes 
to the composite spectrum. The vertical axis represents the relative count rate 
for each mass, because the counts for each mass are adjusted for the number of 
measurement opportunities (integration periods) for each mass. The scale of the 
plot is adjusted so that the minimum count rate is 1 count per integration period. 
The dispersion in the measurements obtained in the three encounters represents 
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the uncertainty on the count rates. The grain measurements are heavily concen- 
trated near the closest approach, peaking with more than one spike per every ten 
integration periods at 2 u, which is the mass most sensitive to the grains. Analysis 
of the ice grain spectrum reveals a composition related to, but distinct from, the 
plume gas. 

Two observations are particularly relevant to the high-mass organic material 
presented in this work (Extended Data Fig. 10). First is the clear dominance of CO 
at 28 u and the lack of No. By applying the well known quantities of dissociative 
species from the impact of 70-eV electrons in the INMS ionization chamber, CO, 
accounts for ~3% of the 28 u signal. N> has an upper limit of ~10% of the 28 u 
signal owing to the low signal at 14 u (N* dissociative ion) and may not be present 
at all. If present, CH, is limited to less than ~6% of the 28 u signal. This leaves a 
requirement for CO to be the dominant species at 28 u, which matches well the 
signal at 12 uand 14 u (C* and CO** dissociative peaks). Although not a unique 
source, CO may be released from carbonyls in HMOC parent compounds by decar- 
bonylation reactions during high-speed impacts of ice grains in the INMS. The 
caveat here is that the INMS integrates over all ice grains detected in the plume. 
Therefore, CO from carbonates or bicarbonates found in type-3 grains”"! might 
contribute here as well. The second observation is the likely presence of C,H3N. 
The signal pattern at 39 u-42 u suggests a species with its highest signal peak at 41 u 
and lower signal in the surrounding masses. C2H3N is the best candidate because 
it matches this ‘stair step’ pattern extremely well. C,H3N also appears to be present 
in the plume gas spectra, possibly because of contamination from a distribution of 
small grains indistinguishable from the incoming gas. 

We note that in contrast to the spectrum in Fig. 3, these ice grain spectra were 

recorded at slow flyby speeds and show no signal with sufficient signal-to-noise 
ratio above 50 u. Unfortunately, ice grain spikes at the high-velocity flyby (E5) 
are not abundant enough to compose a similar spectrum, as shown in Extended 
Data Fig. 10. 
Deduction of an organic enriched layer at the Enceladean water table. The 
observation of high concentrations of solid material in a water ice matrix that 
does not represent the ocean's salinity but is emitted from a saline ocean source, 
requires specific conditions during the formation of these grains. Because of the 
low temperatures near the water table where the ocean water is in contact with the 
ice crust, we conclude that the HMOC parent species are solid. The fact that the 
organics are embedded in a salt-poor water ice matrix indicates that they mostly 
are poorly soluble in water. In principle, primordial refractory organic substances 
trapped in the ice crust near the cracks, which become mobilized by the ascending 
vapour, could serve as an organic source. However, we disfavour this scenario 
because the crack walls are continuously coated by fresh ice condensing from the 
vapour flow’, 

We rather promote a scenario that is well known from ice cloud formation 
over polar waters on Earth?”. There, organic aerosols of mostly biogenic origin”® 
produced by bubble bursting serve as highly efficient nucleation seeds. When bub- 
bles burst on Earth's oceans, an organic-free sea spray forms in parallel with pure 
organic aerosols and mixed phase organic-bearing sea spray”. The organic mass 
fraction of sea-spray aerosol has been consistently shown to be inversely related 
to aerosol size””**” . The purely organic end-members are found preferentially 
in the smallest aerosols”©”* and are mostly water-insoluble”**”. As shown in the 
main text, the size of the organic nucleation cores in oceanic sea spray matches the 
CDA observation quite well. 

Aerosol formation on Earth also provides a plausible analogue mechanism for 
the simultaneous production of salty ocean spray and organic aerosols: if the drop- 
lets are smaller than a few micrometres, then they can be thermally supported in 
water vapour with gas density slightly below the triple point against Enceladus’ 
gravity. They do not fall back into the liquid, but are carried upwards through 
the ice vents by gas from evaporating water, following the pressure gradient into 
space. In parallel to the formation of smaller organic aerosols, larger salty ocean 
droplets form, which are later detected by the CDA as salt-rich type-3 particles in 
the plume and in the E ring. The analogue from Earth would also predict some 
larger mixed-phase particles that carry both ocean salts and complex organics, 
but these have not been identified by the CDA. One explanation could be that the 
macromolecular HMOC parent substance might be hydrophobic and would thus 
naturally avoid forming mixed-phase organic-sea-water aerosols. 

On Earth, bubbles are generated mostly by breaking of waves”’. On Enceladus, 
several volatile gases (CO2, CH4, NH3 and Hz2) have been detected in substan- 
tial concentrations in the plume’? that will inevitably create bubbles when they 
rise though the ocean; these bubbles will then burst at the water table. In Earth’s 
oceans, bubbles not only produce an aerosol, but are also very efficient in ‘har- 
vesting’ organic molecules from the deeper oceanic environment by collecting 
these substances on their surfaces while ascending. The increase of relative organic 
concentrations observed near the surface of Earth’s oceans by 2 to 3 orders of mag- 
nitude**°° suggests a selective transport of organic matter from the bulk seawater 
to the water table and then from the microlayer to atmospheric aerosols?”??*. 
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Possible precursor scenarios for the observed complex organics. There are many 
ways to explain the presence of complex organic materials in an icy moon in the 
outer Solar System. The two general categories of origin are accretion of primordial 
material and endogenic synthesis. In the former hypothesis, the organic carbon on 
Enceladus would predate the formation of the moon, and Enceladus would have 
acquired an organic inventory via its building blocks (icy planetesimals). The latter 
hypothesis would require hydrothermal systems inside Enceladus’ porous rocky 
core’? to produce complex organic molecules from small molecule precursors. This 
category can be broken down into several sub-categories characterized by molecu- 
lar precursors. Organic compounds can be synthesized from more oxidized forms 
of carbon, such as CO», CO or formate***?. More reduced simple organic species 
(formaldehyde, methanol and HCN) can also serve as feedstock for the synthesis 
of more complex organic compounds™*». Methane is a relatively inert species, so 
it may be a less favourable carbon source, unless prolonged metamorphism or radi- 
ation chemistry is involved. Both abiotic and biotic processing of these precursors 
is possible. A mixture of spatially distinct (for example, ocean versus rocky core) 
sources is also not to be excluded at the present state of knowledge. 

As an example of endogenic synthesis, relatively oxidizing hydrothermal con- 
ditions may promote the conversion of simple primordial organics into reactive 
unsaturated compounds, such as quinones, (poly)phenols or aldehydes, which 
may in turn polymerize (possibly in the presence of catalytic minerals) to form 
relatively hydrogen-poor macromolecules. Macromolecules on Enceladus that 
contain aromatic units with connecting short aliphatic chains that include more 
or less oxidized functional groups may resemble some humic substances on Earth. 
On Earth, several pathways exist for the formation of humic substances during the 
decay of biogenic complex organic matter*°. However, these macromolecules may 
also be formed via synthetic routes, such as radical polymerization of phenolic 
compounds in the laboratory. Hanninen et al.*” showed that humic acid-like poly- 
mers can be synthesized from homogeneous, well defined starting materials under 
oxidative conditions. The polymers display clear signatures of phenolic, aromatic 
(olefinic) and carboxyl carbons, and carboxyl carbons are present regardless of 
whether the monomeric unit possesses a free carboxyl group or not. On this 
basis, it was concluded that a partial de-aromatization occurs during the oxida- 
tive polymerization of o- and p-diphenolic compounds. Hanninen et al.°” also 
reported that the °C nuclear magnetic resonance spectra and other features of 
the synthesized polymers resemble to a large extent the spectra of humic acids. 
The characteristics of these macromolecules are consistent with our observations 
from the HMOC spectra. 

One can also consider a primordial origin of organic materials, which would be 
the simplest possibility. If the rocky materials that were accreted by Enceladus were 
analogous to CI-, CM- or CR-group chondrites or refractory cometary solids, then 
a substantial organic inventory is inescapable. CI chondrites in particular contain 
~2 wt% insoluble organic carbon*. This insoluble organic matter (IOM)°*-® is 
considered to be partly of primordial origin (that is, inherited from the interstellar 
medium) and partly modified by early hydrothermal processing on carbonaceous 
chondrite parent bodies. These asteroidal small bodies or planetesimals accreted 
within a few million years after the Solar System formation, heated up as a result of 
radioactive decay (mainly from 7°Al) and persisted at elevated temperatures, caus- 
ing hydrothermal alteration and metamorphism for at least a few tens of millions 
of years after Solar System formation. While analytical data for comets are much 
more limited, recent results from comet 67P indicate a much higher organic carbon 
content of ~30 wt% in dust particles“*®, To put these extraterrestrial numbers into 
perspective, we note that hydrocarbon source rocks on Earth have an average total 
organic carbon value of ~2 wt%®. 

Enceladus organic matter may be similar to chondritic IOM. The latter shows 
some compositional variability: particularly in classes like CV and CO, which expe- 
rienced strong thermal metamorphism at temperatures of about 400 °C-600 °C, 
both the H/C and O/C ratios are lower (down to 0.1 and 0.05, respectively), whereas 
in more primitive classes (CI, CM, CR) mild aqueous activity (100 °C-200 °C) pre- 
vailed and allowed the preservation of O/C ratios as high as 0.23 and H/C ratios® of 
0.8. IOM in the CI, CM and CR classes is considered to be the most primitive type 
of IOM and has an average composition of Cy99H79022N357. It is macromolecular, 
and most of the carbon is incorporated in small aromatic structures, with about 
20%-30% of the carbon constituting aliphatic bonds. The maximum length is 7 
carbon atoms for aliphatic bridges between aromatic units and 4 carbon atoms for 
side chains with a free end®*!. Functional groups containing O, N and S are also 
abundant (see Fig. 2 in Remusat®!). These characteristics are consistent with our 
observations from the HMOC spectra. 

Because this subtype of IOM is thought to be the product of mild thermal and 
aqueous alteration, on the basis of present information it is impossible to say if this 
alteration already happened inside icy planetesimals under the influence of 7°Al 
heating or only later on Enceladus. There, under the influence of tens of millions 
of years of hydrothermal processing’’, more primitive primordial organics could 
have been transformed into a different organic mixture. 
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In the following, we will elaborate further on scenarios that invoke some level of 
processing of primitive organics on Enceladus. At a depth of about 60 km below the 
surface, the ocean comes in contact with Enceladus’ core. The core’s very low density 
(~2,500 kg m~*) suggests ~20% porosity”!*. Owing to the low pressures and mod- 
est temperatures in Enceladus interior, an unconsolidated core with such porosities 
can be inherited from the accretion and differentiation process. Tidal forces help to 
maintain this fragmented state so that ocean water can percolate through it, up to 
the present day. In Enceladus interior, temperatures of 100 °C or more (increasing 
towards the core centre) can be maintained over at least tens of millions of years. 
Depending on the permeability of the core, the entire ocean could be processed at 
temperatures higher than approximately 100 °C within! 25-250 Myr. 

While primordial oxygenated and nitrogenous species, such as alcohols, car- 
boxylic acids, amines and nitriles, may be more soluble in liquid water and can be 
leached into the subsurface ocean of Enceladus, larger polymeric organics would 
exist as a separate organic layer. If not heated above ~300 °C, these organics might 
remain largely unaltered over geological timescales. Intense interaction with water, 
however, might have oxygenated the macromolecules on one end to form micellar 
structures (hydrophilic exterior and hydrophobic interior) and preserved their 
structural integrity over the lifetime of Enceladus. 

Alternatively, a large accreted inventory of IOM-like material could establish 
the basis of a more evolved organic factory inside Enceladus’ rocky core, similar to 
oil- and gas-generating sedimentary basins on Earth. To make this factory opera- 
tional and enable consistency with the organic observations in the plume, organic 
compounds must be mobilized from source rocks. Hydrothermal activity!™? could 
facilitate this process. Cooking organic matter (metagenesis) in the core would 
crack it into smaller constituents, which may then be transported by fluid flow? 
as dissolved species, liquid droplets or entrained particulates (of particular rele- 
vance to the observations). A possible Earth analogue for this general scenario 
is Guaymas Basin in the Gulf of California’. This type of scenario has also been 
suggested for the interior of Pluto™. 

Is a scenario of processing primordial organic materials in Enceladus’ core 
consistent with the chemical and structural features of compounds detected in 
the plume? Before proceeding, it is important to emphasize that it is still an open 
question whether we are seeing features that are representative of bulk organic 
materials in the subsurface. Processes that could fractionate organic compounds 
from their hypothesized source region to our instruments include expulsion from 
the core, (bio)degradation, hydrophobic phase separation in the ocean, plume 
outgassing and impacts during high-speed flybys. In light of these considerations, 
it seems prudent to focus on broad chemical characteristics. 

Our data show the presence of unsaturated carbon and in particular the benzene 
ring. The former finding agrees qualitatively with elevated temperatures that pro- 
mote entropically driven (more product than reactant molecules) dehydrogenation 
reactions. Countering this effect might be high concentrations of H in hydro- 
thermal fluids’’; however, hydropyrolysis experiments demonstrate that aromatic 
carbon can coexist with abundant H,®. The outcome on Enceladus presumably 
depends on the conditions of temperature and H) concentration, as well as the 
duration of heating and the availability of hydrogenation catalysts such as nickel. If 
primitive organic matter in chondrites*’ and comets” can serve as a guide, it can be 
expected that Enceladus would have accreted organic materials containing benzene 
rings. From the perspective of a primordial origin, it therefore makes sense to find 
the benzene ring as part of the organic structures at Enceladus. It is unknown how 
much thermal maturation has occurred, but complete graphitization can be ruled 
out because the presence of organic-bound hydrogen is implied by the data. This 
imposes a limit on the thermal history of the organic source rocks, which seems 
consistent with the persistence of a low-density core rich in hydrated silicates”. 
These organics have not been overcooked. 

Two other compositional features are relevant to a general discussion. First, 
the likely presence of oxygen- and nitrogen-bearing functional groups is consist- 
ent with inheritance of these heteroatoms in accreted organic matter (O/C ~ 0.2, 
N/C = 0.04)**”° or their subsequent incorporation into organic structures by 
hydration or amination reactions (for example, of C=C or C=C) on Enceladus. 
The latter processes would be facilitated by the availability of liquid water and 
ammonia in the interior’*. Having heteroatoms also constrains the degree of ther- 
mal maturation. The second feature that we wish to highlight is the presence of 
methane in the plume”. It must be noted that there are multiple ways to explain 
the origin of methane”', but it is appealing to envision a common mechanism for 
the formation of the full spectrum of organic compounds found at Enceladus. The 
analogy to petroleum geochemistry on Earth implies that thermal processing of 
organic materials will inevitably produce some CH, accompanying more complex 
organics®. The exact quantity of thermal gas will depend on the nature of the 
organic source material and the environmental (for example, redox) conditions. 
At Guaymas Basin, hydrothermal fluids have very high concentrations of CH4 
(~60 mmol kg~')””. If the rocky core of Enceladus is also organic-rich and heated 
sufficiently, then this becomes a plausible scenario. 


Enceladus as the origin of HMOC parent molecules and exclusion of other 
potential sources. Photolysis in the E ring. Most individual ice grains reside in 
the E ring for months to a few decades before they collide (either with a moon or 
the main rings) or before they are completely eroded by plasma sputtering. The 
oldest populations are generally located further away from their origin, Enceladus, 
in the outer E ring’*”*. In principle, one has to consider that either HMOC par- 
ent molecules are not from Enceladus but evolved by photolysis from originally 
simple organics to the observed complex compounds in the E ring, or that the 
complex organics are from Enceladus but have been severely altered—for example, 
by dehydrogenation. However, several observations are not in agreement with 
these scenarios. 

¢ The CDA observed a larger proportion of HMOC-type ice grains in the 
(young) inner E ring than in the (old) outer E ring (see Methods, ‘Relative fre- 
quency of HMOC-type grains depends on impact speed and distance to Enceladus 
orbit’). This indicates degradation of the observed complex organics with time in 
the E ring, rather than their generation. 

¢ During Cassini’s E17 flyby, one freshly ejected HMOC-type ice 
grain was observed inside the plume (this was not included in the compositional 
analysis of this study, but it is shown in Extended Data Fig. 6). Furthermore, 
the CDA observed many HMOCs at locations close to Enceladus’ orbit, 
where most grains had been ejected from the plume only a few days to few months 
ago. 
‘ « The INMS observations were directly made in the plume with freshly ejected 
material. 

Instrument contamination. Intrinsic instrument contamination can be excluded 
as a source for the abundant organics in the spectra’’. Of concern are the large 
PAHs in Titan’s atmosphere, which may have deposited onto the surface of the 
CDAs impact target during close Titan flybys. Several pieces of evidence point 
against cross-contamination from Titan: 

« The first HMOC spectra were already detected in 2004 and early 2005, before 
Cassini's first close flyby of Titan. 

« The CDA has a decontamination device that was used between Titan flybys, 
which heats the CDA impact target to 100 °C for several hours. 

« No buildup of organics can be observed in spectra of E-ring grains over time. 

¢ The fact that only about 1% of impacts in the E ring show such a massive 
organic signature is not in agreement with a coating of organic contamination on 
the impact target. 

« The trend to observe HMOC-type grains at a higher frequency closer to 
Enceladus orbit than in the outer E ring (see ‘Relative frequency of HMOC-type 
grains depends on impact speed and distance to Enceladus orbit’) is not in agree- 
ment with a contamination origin. 

¢ Thousands of spectra from impacts of dust populations outside the E ring 
(stream particles, exogenous dust and main-ring dust) never show HMOC features. 
Contamination of INMS spectra from previous measurements is unlikely. The 
multiple environments encountered by the INMS require an evaluation of the 
potential for one environment to deposit refractory material onto the surfaces of 
the CSN antechamber and for that material to be released and measured at a later 
encounter. Of particular concern are the large PAHs in Titan’s atmosphere and 
Enceladus ubiquitous nanograins, which may fragment material deposited onto 
the antechamber surfaces, sputter the organic products, and thus release volatiles 
into the INMS. 

The strongest evidence that high-speed nanoparticles do not create false signals 
in the INMS spectra is that the amounts of PAHs encountered at Titan are insuf- 
ficient to produce the INMS measurements obtained during encounters such as 
E5. At Titan, both the INMS and the CAPS measure PAHs or their fragments, with 
benzene observed by both instruments. Waite et al.’° and Lavvas et al.’° analysed 
these measurements to estimate the abundance of PAHs as functions of altitude and 
of mass. Combining these estimates with the conservative assumption that every 
PAH molecule that enters the INMS CSN aperture during the Titan encounters 
is then released during E5, the resulting signal would be more than an order of 
magnitude below the level measured by the INMS at the fast Enceladus flyby (E5), 
as detailed in the following paragraph. 

The benzene density in Titan’s atmosphere is well modelled by an exponential 
with a scale height of 15-20 km and a density of approximately’® 10° molecules per 
cubic centimetre at 950 km. Before E5, there were thirteen Titan encounters with 
the INMS pointed to accept atmospheric neutrals, and approximately 10"! high- 
er-mass molecules entered the INMS CSN antechamber during those encounters. 
To produce the E5 measurements, the CSN antechamber maintained a density 
of less than 10” molecules per cubic centimetre for at least 40 s for the heavier 
species. With a residence time of only a few milliseconds, more than 10!” heavy 
molecules were required in the CSN antechamber during E5. The number of mol- 
ecules encountered at Titan is thus at least a factor of ten below the number of 
molecules required in the CSN antechamber to produce the higher-mass count 
rates observed during ES. 
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There is further evidence that the INMS spectra are unaffected by previous 

encounters: the CSN spectra at Enceladus do not change over many years, despite 
scores of Titan encounters during that period. Furthermore, there is no appar- 
ent dependency of Enceladus’ spectrum on whether a Titan encounter occurred 
between sequential Enceladus flybys. 
Laser dispersion analogue experiments for icy dust impacts. Extended Data 
Fig. 9 shows a sketch of the experimental setup in Heidelberg. A liquid water 
beam, in which the tested substances are dissolved, is injected into a high vacuum 
(5 x 10-5 mbar) through a quartz nozzle (with opening radius 6-8 1m). The 
quartz nozzle is mounted on a three-axis manipulator. A high-performance liquid- 
chromatography pump (model 300c, Gynkotek) is used to keep the flow rate 
constant at 0.17 ml min~!. The liquid beam flow is stable for ~3 mm downwards 
and then disintegrates into droplets. To maintain the high vacuum, the liquid pours 
into two liquid-nitrogen-cooled cryotraps. To perform the measurements, the 
water beam is hit by a pulsed infrared laser (Opolette HE 2731, OPOTEK; 20 Hz, 
7 ns pulse length), which operates at a wavelength of approximately 2,850 nm and 
a pulse energy of up to 4 mJ. The laser is directed and focused onto the liquid by 
three CaF, lenses and a gold mirror. The wavelength is chosen to specifically excite 
the OH stretch of the water molecules. When the laser energy is absorbed by the 
water beam, the water is rapidly heated and disperses explosively into fragments. 
In this way cations, anions, electrons and neutral molecules are created?”””. 

Depending on the operation mode, cations or anions that pass through a skim- 
mer are analysed in a reflectron TOF mass spectrometer (Stefan Kaesdorf). This 
mass spectrometer operates according to the principle of delayed extraction. The 
delay time is adjusted with the aid of a pulse generator (model DG535, Stanford 
Research Systems). By adjusting the delay time, ions with a distinct initial velocity 
are selected for detection by a microchannel plate detector. In combination with 
different delay times, variable laser intensities are used to simulate different impact 
velocities of ice grains onto the CDAs target (EK. et al. manuscript in prepara- 
tion). The detected signals are intensified by a preamplifier, visualized by a 12-bit 
digitizer card (Acqiris) and recorded using Lab VIEW. Flow injection of the tested 
solutions is accomplished with an injection valve (model MX9925, Rheodyne). 
500 single spectra are averaged to achieve the co-added spectra, as presented in 
Extended Data Figs. 7, 8. The chemicals (pro analysi) were purchased from Sigma 
Aldrich. All solutions were freshly prepared with doubly distilled HzO in lockable 
50-ml sample cups. 

Data availability. All CDA data used for this analysis are archived on PDS-SBN 
(http://sbn.psi.edu/archive/cocda). The exact time stamps of each data point are 
listed in Extended Data Table 1. 
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Extended Data Fig. 1 | See next page for caption. 
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Extended Data Fig. 1 | Semi-quantitative display of CDA HMOC 
spectra. a, b, Identified organic mass lines of individual HMOC spectra. 
The distribution of resolved mass lines and flank peaks of 64 HMOC 
spectra with the most distinct HMOCs are shown (see Methods, ‘Selection 
of 64 high-quality spectra for Fig. 1 and Extended Data Figs. 1 and 3’). 
19 more spectra with a high level of interference with water-cluster 

ions or low signal-to-noise ratio are not included here (see Extended 
Data Table 1). All peaks depicted here are also part of the data shown in 
Fig. 1b. The spectrum number (as defined in Extended Data Table 1) is 
indicated on the left as an identifier of the event. The extent is indicated 
by the horizontal length and the relative normalized amplitude of each 
spectral feature is given by the length in the vertical direction and the 
colour code (red being the highest and blue the lowest amplitude). The 
largest horizontal span of the symbol marks the peak maximum. In b, 
the amplitudes between 70 u and 85 u shown in grey indicate that they 
are not to scale with the symbols shown at higher masses (they would 

be much larger; see Fig. 1b for comparison). Spectra are sorted by their 
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impact speed, as estimated from the orbital elements of the impacting 
grain, with the highest speed (~15 km s~') at the top of the graphs and 
the lowest (~5 km s~!) at the bottom. Because the exact orbital elements 
are unknown, each impact speed has substantial intrinsic uncertainties, 
given in Extended Data Table 1. The 12 spectra for which a minimum 
impact speed could be derived from the presence of hydrogen mass 

lines (Extended Data Table 1; see Methods, ‘Selection of 64 high-quality 
spectra for Fig. 1 and Extended Data Figs. 1 and 3”) are placed at the top. 
The highest mass at which the recording of the CDA TOF spectrum ends 
varies between 174 u and 226 u (Methods, ‘Short description of CDA’s 
chemical analyser subsystem), as indicated by the grey horizontal bars. As 
a consequence, the frequency of the HMOC peaks around 178 uand 191 u 
in Fig. 1b is reduced because not all individual spectra cover this mass 
range. The absolute masses in each individual spectrum have an intrinsic 
uncertainty (absolute value) of +1 u at 80 u and +2 u at 180 u owing to 
the limited calibration accuracy of the CDA in this high-mass regime. The 
mass intervals between peaks, however, are accurate to the integer level. 
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accelerator facility*® with an impact speed of 5.2 km s~'. Above 100 uand 
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Extended Data Fig. 3 | Comparison of CDA HMOC spectra from fast 
and slow impacts. White and grey spectra represent the average of all 
spectra from impacts below and above ~10 km s~', respectively 

(see Methods, ‘Selection of 64 high-quality spectra for Fig. 1 and Extended 
Data Figs. 1 and 3’). All signatures with possible major contributions from 
inorganic species are colour-shaded as in Fig. 1. Signatures not marked are 
exclusively or mostly due to organic cations. The abundance and position 
of the HMOC species is relatively independent of the impact speed of the 
ice grain (see also Extended Data Fig. 1). By contrast, fast impacts induce 
stronger organic fragmentation signatures at masses below 70 u and 
HMOCs form more distinct, evenly spaced groups, characteristic of 
impact-induced dissociation processes. In turn, slow impacts show more 
abundant intact benzene-like cations. There seems to be a tendency of 
some organic cations to carry fewer H atoms at fast impacts (27 u, 51 uand 
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62 u), which is indicative of ‘softer’ ionization from the slower impact. In 
fast spectra, interference with water-cluster ions is less frequent than at 
lower speeds. In contrast to fast spectra, fragmentation below CH; (15 u) 
is usually not observed in slow spectra. Spectra from slow impacts are 
prone to abundant water clustering, creating mass lines of the form 
H*(H,0),, with n = 1-4, at 19 u, 37 u, 55 wand 73 u (blue). In fast spectra, 
clustering is limited and only the mass lines at 19 u and 37 u are generally 
present; occasionally, formation of the smaller water ions OH* (17 u) and 
H,O* (18 u) is observed. Similarly, Rh* (103 u) forms from excavation of 
the impact target only at fast impacts and interferes with HMOC species 
there. To a lesser extent, this is also true for the rhodium-—water cluster 
Rh*(H,0) at 121 u. See the individual CDA spectra in Extended Data 
Fig. 4 for comparison. a.u., arbitrary units; ToF, time of flight; lg, log. 
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Extended Data Fig. 4 | See next page for caption. 
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Extended Data Fig. 4 | Example CDA spectra from individual HMOC- 
type ice grains. In these individual spectra, the peak definition is naturally 
higher than in the co-added spectra shown in Fig. 1 and Extended Data 
Fig. 3, and therefore some of the spectral features collected in Fig. 1b 
become more apparent. a, HMOC spectrum from one of the fastest 
recorded impacts (12-18 km s~'). The appearance of hydrogen cations 
(Ht, Hy and H}) at 1 u, 2 uand 3 u, as well as the disintegration of the 
CH? ion into CH}, CH* and Ct (12 u-15 u) and the formation of H,Ot 
(18 u), are evidence of the high-speed impact. The abundance of 
unsaturated small cations below 70 u, probably fragments from aromatic 
structures, is increased compared to slower spectra. The frequently 
occurring mass line at 45 u (Fig. 1b and Extended Data Fig. 1) is 
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noticeable; it cannot originate from pure hydrocarbons and requires 
heteroatoms, probably oxygen in this case. While a 45 u feature is quite 
common in our HMOC dataset, the peak at 86 u is only apparent in this 
spectrum. b, HMOC spectrum from a grain detected at intermediate speed 
(5-8 km s~!). High-mass fragments and benzene species are abundant 
whereas further fragmentation of the benzene ring into C; and Cy species 
is less apparent compared to high-velocity impacts (a). We note that 
organic cations with 2, 3, 4 and 5 C atoms show a tendency to carry more 
H atoms compared with the high-speed impact, which is indicative of 
‘softer’ ionization from the slower impact. Organic fragmentation below 
CH} is usually not observed in this speed regime. 
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Extended Data Fig. 5 | See next page for caption. 
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Extended Data Fig. 5 | Example HMOC CDA mass spectrum with 
extended mass range and statistics for all features. a, Ice grain spectrum 
showing the HMOC event with the strongest extended mass range signal 
of the dataset. The dashed line at 6.4 1s divides the spectrum into the 
high-resolution part (10 ns sampling) and the low-resolution part (100 ns 
sampling) (see Methods, ‘Short description of CDA’s chemical analyser 
subsystem’). There are several relatively narrow peaks between 250 u and 
500 u and two much more extended features peaking at about 1,000 u and 
1,800 u. In this case, the cations with mass in excess of 200 u are more than 
twice as abundant (defined by the area under the curve) as those below 
200 u. We note the logarithmically scaled TOF axis in this case. These 
features are usually less frequent and less pronounced than in the extreme 
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case shown here. The extended spectrum frequently shows an instrument- 
artefact peak at 6.8 js, which was not considered in our analysis. 

b, Histogram showing the frequency of occurrences of the features 
observed in the extended mass range. The definition, and thus 
significance, of peaks in the extended spectrum is generally lower than in 
the nominal spectrum. In particular, features above 500 u are sometimes 
ambiguous and their interpretation should be taken with great caution. 
However, the statistics shows three preferred mass regions: 200 u-500 u 
with decreasing frequency, around 1,000 u and around 1,700 u. Even if 
no sizeable peaks are present, the cation signal in the HMOC spectra is 
generally higher than the noise level when the low-resolution recording 
starts and typically only decays to noise level at around 500 u or later. 
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Extended Data Fig. 6 | CDA HMOC spectrum recorded in the environment of the plume, the instrument settings had to be modified in 
Enceladean plume. During Cassini’s E17 flyby of Enceladus’ south pole at —_ a way that compromised the spectrum quality (lower sensitivity and lower 
75 km altitude, where the CDA recorded about 40 plume spectra with its mass resolution; see Extended Data Fig. 4 for comparison). The spectrum 


full mass range, one spectrum was of the HMOC type. This isin agreement _ is baseline-corrected. On the only occasion when the CDA recorded a 
with the proportion of this particle type being a few per cent in the plume large number of spectra with high cadence directly in the plume during 
and in the E ring close to Enceladus (see Methods, ‘Relative frequency of Cassini's E5 flyby’, the spectral range was truncated below about 100 u 
HMOC-type grains depends on impact speed and distance to Enceladus to allow for a higher data rate. This unfortunately did not allow the 
orbit). The flyby speed determined the impact speed of 8.6kms~' andthe _ identification of the defining HMOC signatures. 

particle had a radius of about 2 jm. To operate the CDA in the dense dust 
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Extended Data Fig. 7 | Laser ionization mass spectrum of benzoic acid 
and benzyl alcohol dissolved in water. Analogue TOF mass spectrum 
recorded with the liquid microbeam ionization setup (see Methods, ‘Laser 
dispersion analogue experiments for icy dust impacts’ and Extended Data 
Fig. 9) to simulate the formation of tropylium and benzene cations and 
their fragmentation ions at impact speeds’® of the order of 10 km s~'. The 
concentrations of benzoic acid and benzyl alcohol are 3 g1~! and 0.2 g1-}, 
respectively. Water ions are marked in blue, aromatic ions and ions 

from aromatic fragmentation are marked in orange and mixed organic- 
water species are yellow. To yield both benzene cations (77 u-79 u) and 
tropylium ions (91 u), two different aromatic structures are required 

(Fig. 2). The predominant aromatic fragments of benzoic acid are at 77 u 
and 79 u, whereas benzyl alcohol almost exclusively forms tropylium ions 
at 91 u. The peak at 95 u is a water cluster of the phenyl cation, which is 
much more pronounced than in the HMOC spectra. Although the strong 
phenyl-water cluster signature here illustrates the intimate mixing of 
organics with water, the much lower 95 u signature in HMOC spectra 
argues for less efficient mixing of organics with water there, probably due 
to a core-shell structure that physically separates organics from ice in the 


grain. Cations from the fragmented ring can be seen at 39 u, 51 u-53 u 
and 63 u-65 u and agree with the CDA observations (Fig. 1b). In contrast 
to the CDA spectra, however, saturated C3 fragments (41 u—43 u) are 
depleted, and C2 (27 u-29 u) and C, (15 u) fragment cations are entirely 
absent, confirming the presence of an abundance of aliphatic cations in 
HMOC grains. The ratios of benzene and tropylium ions and the water 
ions match the HMOC spectra well. The total concentration of organic 
species used here (~0.32% by weight) can be used to estimate a lower 
limit for the concentration of organics in CDA HMOC grains for two 
reasons. First, in the analogue experiment we selected substances that 
most efficiently yield the desired aromatic species and other, less efficient 
precursors would yield even lower signals at 77 u and 91 u. Second, to 
account for both the low- and high-mass fragments between 100 u and 
2,000 u, which are absent in the laboratory spectrum, additional organic 
substances or larger molecules would be needed to further increase the 
organic concentration. Therefore, the concentration in Enceladean HMOC 
ice grains in many cases can be estimated to be near or even above the per 
cent level. 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


Matrix 


Pyrene molecular 
0,03 ———— 


Pyrene 
fragment _ s 
0,02 - B 
5 i 
ca =  Pyrene 
2 J 
OD fragments 
Cc 
o e 
f= /\ Pyrene fragment 
Bo g s 
0,01 = a bre 
st IN 
| = SN 9 8 


70 80 90 100 110 120 130 140 150 160 170 180 190 200 210 
Mass [u] 


Extended Data Fig. 8 | Laser ionization analogue spectrum of pyrenein _ blank experiment with just the solvent mixture and are either direct pyrene 


water-acetic acid mixture. Cationic TOF mass spectrum recorded with fragments or cations formed from pyrene fragments clustering with the 
the liquid microbeam ionization setup (see Methods, ‘Laser dispersion solvent (for example, at 159 u). The molecular mass lines of pyrene are 
analogue experiments for icy dust impacts, and Extended Data Fig. 9) about 10 times more abundant than those of any fragments, indicating 
containing 0.1% pyrene dissolved in a mixture of water and acetic acid. the stability of the PAH molecule. In contrast to CDA HMOC spectra, no 
The laser pulse simulates an ice grain impact with a speed” of about isolated benzene ring fragment (77 u or 91 u) forms. 


10 km s"!. Features marked as ‘pyrene fragment’ do not appear in the 
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Extended Data Fig. 9 | Laboratory setup used to simulate ice grain impacts onto space-borne impact ionization detectors. See Methods, ‘Laser 
dispersion analogue experiments for icy dust impacts, for a detailed description of the setup. 
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Extended Data Fig. 10 | Co-added INMS ice grain spike spectrum from 28 u signal. CO, and C2H, collectively contribute less than ~10% of the 


50 


three plume encounters. See Methods section ‘INMS ice grain spectrum’ 28 u signal. CO (blue circles) is required to fit the rest of the 28 u signal 
for details on how the spectrum was composed and analysed. Error bars and the entire 29 u signal and matches its other dissociative peaks (C at 
(1 s.d.) are derived from the dispersion of the count rate from the three 12 uand CO** at 14 u) well. The spectrum also indicates the presence of 
separate measurements of the individual encounters. The spectrum nitrogen-bearing species: the ‘stair step’ pattern around 41 u matches best 


suggests the presence of CO fragments (blue circles) as an oxygen-bearing to the C,H3N spectrum (red circles). 
species. Nz has very low abundance and contributes less than ~10% of the 
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Extended Data Fig. 11 | INMS spectra used to produce the differenced (black) spectrum, which is normalized to match the 15 u signal of the 
spectrum in Fig. 3. a, b, The individual spectra for fast (E5; a) and slow slow spectrum (grey). The residual (difference) between the two spectra is 
(b) flybys are shown. c, The spectra of a and b are plotted with the E5 plotted in Fig. 3. 
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Extended Data Fig. 12 | Schematic on the formation of organic aerosols made of insoluble organic material that later serve as efficient 
condensation cores from a refractory organic film. a, Ascending condensation cores for the production of an icy crust from water vapour, 
gas bubbles in the ocean” efficiently transport organic material” into thereby forming HMOC-type particles. In parallel, larger, pure salt-water 
water-filled cracks in the south polar ice crust. b, Organics ultimately droplets form (blue), which freeze and are later detected by the CDA as 
concentrate in a thin organic layer (orange) on top of the water table, salt-rich type-3 ice particles in the plume®”. 


located inside the icy vents. When gas bubbles burst, they form 
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Extended Data Table 1 | List of 83 CDA mass spectra identified as of HMOC type 


Pp # | ure sc |e) ROR) {ZUR | Vine (| Vo | Vow | Radius (um) 
|___1| 2004-302/21:31:38 | 1477691821] 1.34e-14|_ 6 | 0.22] 6 4] 47] Oo] 3 || 
|___3 | 2005-068/19:12:50 | 4googe367{_—2.30e-14| 6.24] co.nz | 4 2] 06] og | || 
|__4| 2005-068/19:57:18 | 4g9091035[ 5296-14 oi | 02 | sat 62] ioe] ov] oat 17] 
|___5 | 2005-177/o9:15:19 | 1498470176 | 1.02e-13{ 68-44 se*| 64] i09t o7] oat 19] 
|___6| 2005-177/09:16:46 S| 1498470263 | 4. 43e-14| 5.67 | 2.44] | 64] 10.9] oo | 3 | | 
|___8 | 2005-177/10:15:39 | 1498473796 | 2. 55e-14| 52-3] a7] 64] 0} oo | 2 || 
|___9| 2005-177/10:19:23 | agga74o2z0[ 1.32e-14[ ig] ait s7{ 64] sof os] o2t 14] 
|___10 | 2005-177/10:23:56 | 1498474203 1.62e-14[ 5.14] -tos| a7] 64] tot os ot 14 | 
|__11| 2005-177/11:24:38 S| 1498477935 | 2.tze-14|_ 469 | 0.74] 8G] 6 4] 109] oo | || 
|___12 | 2005-177/11:36:22 | 149ga7ae39[ so 2.gse-14{ 4.61 0.67] sot 64] 09 og | zt | 
|___13 | 2005-177/11:46:21 | 1498479238] 4or-14] 4 sa] 062] a6] 64] 09] os zt | 
|___14| 2005-177/12:00:40 | 4gg4goo97{_5.02e-14 [| 4 as | 053] pert 63] ost oe] oat 17] 
|___15 | 2005-177/12:22:55 | 149g4gi432{5.43e-14{ | 04 ast] 63] 07] 06 | 3 | 7] 
|___16 | 2005-177/12:24:48 | saggagisas[254e-14[ 4.29] 0.39] ss*[ 63] 07t os] o2t 15] 
|___17 | 2005-177/12:31:51 | t4ggagig6s | 3.266-14[ 425-035] ast 63] 107] og | 3 | 16 | 
|___19 | 2005-177/12:49:08 | ag9g4g3005| aaze-i4] gis] -o.24] g4{ 62] o6{ ost 03] 47] 
|___ 20 | 2005-177/12:50:33 | t49sag3090[ a.e2e-14] aay -o2at eat 62) io6f ost o2t 15 | 
|___21 | 2005-177/12:56:32, | taggagzaag[5.276-14[ ati | o2t a3t 62] ost o7] 03] i8] 
|___22| 2005-177/15:48:06 | 1498493743 2.20e-14{ 3.61] ost 6 9 | 7] 7] 319 | 
|___23 | 2005-177/16:47:37_ | 49gao73i4[3.55e-13[ 3.69] ait 66f 49 f a3 at ost 34] 
|___24 | 2005-267/03:21:33 | 506224999 340-14 [5.24] oot i | | aso] zt 15] 
|___25 | 2005-267/03:22:22, | 1506225048] 924-14] 5.25] ot 9.2 | 8] as | 6] 3] 47 | 
|___26 | 2005-267/03:35:30 | 506225836 2.75e-14{ 5.37[ oon zt ost tot ost o2t 14] 
|___27 | 2005-267/07:34:31 S| 1506240177 | __2.94e-14|_ 752] ot S| a2] 6 | 2 | 5 | 
|___28 | 2005-303/04:28:34 |] 1509339440[ 3. 74e-14[ 5.76] o.04 4 [sti o9T oat 23] 
|___29 | 2005-303/04:43:14_ | 1509340320[ Ss a75e-14{ 5.85] o.04 S| |] ost oat | 
|___30 | 2005-303/o4:a804_ | 509340610 [ 4. aze-a4[ 5.88] oo4t oe t 4st a3 oot oat 23 | 
|___31 | 2005-303/05:03:27, | 1509341533] agae-14] 5.98] 04 oo | oat oot oat 23 | 
|___32 | 2005-303/05:43:23 | = 1509343929| 4. 32E-14| 6.24] 0.04 no | | 6] | | 22] 
|___33 | 2005-303/05:53:09 |] 509344515[ 5.82e-13{ 6 3] oa fit aot a2t ost 32] 
|___34 | 2005-303/05:59:07_ | 1509344873| 7.83e-14| 6.34] 04 6 | | 7] og oat 2 
|___ 35 | 2005-303/06:36:17_ | 1509347103[ 3.31e-13[ 6 59] o.04 of 5.2 ast ait oat 29] 
|___ 36 | 2005-303/06:41:04 | 1509347390| 8. 32e-14{ 62] 04 70f 2] ast oot oat 23 | 
|___37 | 2005-330/22:05:55 | 514735696 | 2.396-14| 8.97 | -0.06] 73] 5.4] 92} oz} | 8 | 
|___ 38 | 2005-358/14:37:41 | 5i4i2sois{ 6. 68e-14{ 2] 0.03] 68] 5.0] a6 og oat 23 | 
|__39 | 2005-358/14:49:12, |S 1514128709| 432-13] 6.13] __—-0.03| 68] 5.0] 85) 12] o5| 3.1] 
|___40 | 2005-358/15:57:10 | 1514132787[ 2.07e-14{ 5.71 -o.02t 64] 47] it ost oat 2a] 
|___41 | 2005-358/15:57:39 |] 1514132816 | 4. 30E-14{| 7 | 0.02] 6 4 79a t 24 
|___42 | 2005-358/16:09:40 | 1514133537 7.77e-5| 5.63] -0.02] 6 3] 47] oto] 38 | 
|___44 | 2005-359/04:06:40 | 1514176557{_ 2.87e-14{ 6.19] 04 sof ot ost oat 2a] 
|___45 | 2005-359/04:06:52, | 1514176569| 960-14] _ 6.19] 0.04] 6 | 5] 6} | | 5 | 
|___46 | 2005-359/05:10:40 |] 514180397{ 5.38e-14[ 62] 04 of zt ast ost oat 22] 
|___47 | 2005-359/05:20:58 S|] 1514181015 | 4. 33e-14| 6.69 | 05 | 70f 2] t | 
|___48 | 2005-359/06:01:55 |] 514183472[ 2.age-14[ 98] 05 if 53f oot ot oat 19] 
|___ 49 | 2005-359/06:26:12 | t514isag29 | 7.a4e-i4] 7.5] 05] zt 53] of ost oa] 22 | 
|__50 | 2005-359/06:32:15 |] 514195292 [ 1.35e-13[ 7.19] os 72t 5 3f oot gt oat 2a | 
|___51| 2006-016/os:o8:18 |] 5609168 { 5.756-14{ 12.91 -o.08| 6 S| 42 og| oat 24 
|___52| 2006-056/05:15:38 | 1519537530] _—.26e-14|_ 6.46] _—-0.03] 54] 40; 68) 09] 04] 24] 
|___53| 2006-056/17:46:16 | i5i9se2s69[ t.65e-14{ 82] 04 7 ft 2 oot oat 23] 
|___54| 2006-056/18:48:22, |S i5i95a6295| 9. g6e-5| 7.15]  o.04 | 6 of | 5s} o3t | 
|__55| 2006-056/22:14:58 | asagsogeoi | 132e-14] aos | S| at aif oz] oa] 19 | 
|___56 | 2006-057/02:02:07_ | 1519612320| 6. 24e-14{ g.sa| ov] 6 | goat 2] 
|___57| 2006-057/02:05:38 |] 519612531 3.55e-14{ 9. 9 | cov | sf of tos ost 22] 
|__59 | 2006-080/19:20:46 S| 1521661852 | _—.79e-14| 13.1] —o.09| og S| | 2] os] 3] | 
|___60 | 2006-337/o0:41:01 | 1543799608 [ s.t7e-t4{ 4.9904 az6f  o3f ssf oat o2t i 
|___62| 2006-337/00:55:19 | 543800466 [ a77e-14{ 5.04] o.zt af a3 f it oat o2t it] 
|___63 | 2006-337/00:59:05 | 1543800692 | 1.55e-14{ 5.05] o.25] 125| 92] iss] 03] ot 09] 
|___64 | 2006-337/01:18:39 | 1543801866 | _—1.86e-14| 5.13] 0.48 | 124] 9.2] 156] 4 | | | 
|___65 | 2006-337/01:26:38 |] 1543802345[ 3.85e-14{ 5.16] sz] i24*{ ait is6t o4to2t a] 
|___66 | 2006-337/04:59:37_ | 1543815124[ 1.57e-13| 6.24] 2.93] 106*| 7.8] 133) 06 | 2 | 5 | 
|___67 | 2006-337/05:49:38 | a543gigi2s|a7se-14] 54] 3.43] yoif 75] iz8{ o4t 2] 12] 
|___68 | 2006-349/04:52:09 | 1544851483[ a g6e-14| 7.89] -2.66] 89] 65] 112) 06] o.2| 164 
|___69| 2006-349/16:06:48 | i54aggi962[ 3.95e-14[ to.08 | 26] a 3[ of io5] oe] o3f 17] 
|___70 | 2007-130/19:21:23 |] a5575ag1i7{ 8. 3e-14{ a] ost 103*| 76] 130f os] o2t 15 
|__71| 2007-130/19:32:19 | 1557518773| ___—5.98-13| 4.75] _—-0.02| 10.3*| 7.6] 130) 07] 03] 19] 
|___72 | 2007-130/19:37:15 | 557519069 [4 t0e-14{ 4.73] -o.06] 103/76] 130] o5]  o2f 13] 
|___75 | 2007-130/19:51:50_ | 1557519944[ 1.4ge-13{ 467-02] 103; 76] 130) 06] oz | | 
|___77 | 2008-130/23:08:03 | 1589067930| it. t6e-13[ 4.67] oe | 143*| 106; iso  o4] o2f 1] 
|___78 | 2008-130/23:57:13 | s9070sgo| __—2.02e-14|_ 4 | | i4e*| 108] 185] 03] 01] 08] 
|___79 | 2008-131/00:20:35 | 589072282 a.zge-14] gaa] 0.21] i4.7*[ 109] is6{  o3f o1f 0.9 | 
|___80| 2008-131/02:18:04 | 1589079331 | 5.10E-14 | 3] -a75 | 14a*| 104] 178) 03] oi] 0.9 | 
|___81 | 2008-131/03:00:00 | issgogisa7{ es s.66e-14{ 432 2.26] 135/100; 170] o4]  o2{ 14 
|___82 | 2008-131/03:16:31 | 589082838 {1.34e-13{ 4.35 [2.46] 13.2[ 98st 167] 04 | 0.2 i 

[83 | 2008-131/03:19:14 |] 589083001 6 436-14 435-2497 13.2*[  97[ 166] o4To2f a] 


The columns of the table show the number of the spectrum (#, in order of detection); the time of detection in coordinated universal time (UTC) and as measured by the spacecraft clock (SCLK); the 
amplitude of charge measured at the CDA ion grid (QI (C)); the spacecraft’s distance to Saturn’s rotation axis (R(Rs); where Rs = 60,268 km is Saturn’s radius); the spacecraft’s distance to the ring 
plane (Z(Rs)); grain impact speeds (in kilometres per second, see below); and the corresponding grain radii estimates (in micrometres). For the grain impact speeds, the velocity Vimp is derived assuming 
that grains are in circular Keplerian orbits, and V_, and V,,, are the +1o of the grain impact speed distributions, inferred from an E-ring model derived from numerical simulations’274. 

The grain radii shown in the last three columns are calculated on the basis of Vimp, V-, and V,, using equation (3.37) of ref. 78 

Events for which the UTC time is shown in bold fonts are the 64 high-quality spectra (see Methods, ‘Selection of 64 high-quality spectra for Fig. 1 and Extended Data Figs. 1 and 3’). Speeds marked 
with an asterisk denote spectra in which one or more hydrogen cations (H*, H2*, H3*) are present, which is used as an independent and reliable criterion for impact speeds larger than 10 km s~! even 
if the number derived from the assumption of circular orbits (Vimp) is lower. 
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Flying couplers above spinning resonators generate 


irreversible refraction 


Shai Maayani)’, Raphael Dahan’, Yuri Kligerman!, Eduard Moses!, Absar U. Hassan3, Hui Jing*, Franco Nori>°, 


Demetrios N. Christodoulides? & Tal Carmon"* 


Creating optical components that allow light to propagate in only 
one direction—that is, that allow non-reciprocal propagation 
or ‘isolation’ of light—is important for a range of applications. 
Non-reciprocal propagation of sound can be achieved simply by 
using mechanical components that spin!”. Spinning also affects 
de Broglie waves’, so a similar idea could be applied in optics. 
However, the extreme rotation rates that would be required, owing 
to light travelling much faster than sound, lead to unwanted 
wobbling. This wobbling makes it difficult to maintain the 
separation between the spinning devices and the couplers to 
within tolerance ranges of several nanometres, which is essential 
for critical coupling*®. Consequently, previous applications of 
optical” and optomechanical!*”-”° isolation have used alternative 
methods. In hard-drive technology, the magnetic read heads of a 
hard-disk drive fly aerodynamically above the rapidly rotating 
disk with nanometre precision, separated by a thin film of air 
with near-zero drag that acts as a lubrication layer”!. Inspired by 
this, here we report the fabrication of photonic couplers (tapered 
fibres that couple light into the resonators) that similarly fly above 
spherical resonators with a separation of only a few nanometres. 
The resonators spin fast enough to split their counter-circulating 
optical modes, making the fibre coupler transparent from one side 
while simultaneously opaque from the other—that is, generating 
irreversible transmission. Our setup provides 99.6 per cent isolation 
of light in standard telecommunication fibres, of the type used for 
fibre-based quantum interconnects”. Unlike flat geometries, such as 
between a magnetic head and spinning disk, the saddle-like, convex 
geometry of the fibre and sphere in our setup makes it relatively easy 
to bring the two closer together, which could enable surface-science 
studies at nanometre-scale separations. 

The real part of the refractive index of a material describes the speed 
of light in that material on the basis of linear light-matter interaction. 
An irreversible refractive index”? was originally demonstrated by 
Fizeau in flowing water, in an experiment of paramount importance 
for the development of special relativity. Such an irreversible index has 
generally been considered in photonics only when the entire apparatus, 
including its detector and emitter, is moving (such as in gyroscopes). 
Controlling photonic structures in such a way that they exhibit an irre- 
versible index is challenging, as is controlling rapidly moving dielectrics 
while self-positioning them with nanometre precision. Therefore, cur- 
rent photonic technology and fundamental studies that involve light 
generally use stationary, non-self-positioned dielectrics and rely on a 
reversible index that treats counter-propagating light equally. Here we 
report a photonic device containing a dielectric material that moves 
sufficiently fast, while its position is self-adjusted finely enough, 
to achieve an irreversible index, which either passes or blocks light 
depending on the direction it comes from. 

We fabricate a spherical resonator by melting the end of a sili- 
ca-glass cylinder, flame-polishing it and then mounting the res- 
onator on a turbine (Fig. 1d, Methods). In our experimental setup 


(Fig. 1), this rotating silica sphere, which is resonant when its cir- 
cumference corresponds to an integer number of optical wavelengths, 
is positioned near a tapered region of a standard single-mode tele- 
communications fibre. This tapered region is used to coupling light 
evanescently*® into the rotating resonator. In the same manner, light 
is coupled out through the other side of the fibre via the same coupler. 
Each side of the fibre thus serves simultaneously as both an input port 
and an output port. Depending on the input port, light is coupled to 
circulate in the sphere in either the clockwise or the anticlockwise 
direction. Because of the Fizeau drag, the refractive index that light 
experiences in the spinning dielectric sphere is different in these 
counter-circulating directions. As a result, light entering from one 
side of the fibre is on resonance and is critically absorbed**, whereas 
light entering from the other side is off resonance and so is almost 
100% transmitted. 

Central to the realization of such an irreversible index are the aer- 
odynamic processes at play when the nanoscale fibre is only a small 
distance above the rapidly spinning spherical resonator. Numerical cal- 
culations (Methods; Fig. 2d) demonstrate that, for a sphere spinning 
at an angular velocity §2, a boundary layer of air forms that is dragged 
past the stationary taper. By dragging the air into the region between 
the taper and the sphere, the moving sphere causes the taper to fly at 
a height h above the sphere, owing to the air pressure on the surface 
of the taper that faces the sphere (the ‘air bearing’ surface). If any per- 
turbation causes the taper to rise higher than this stable-equilibrium 
height, it floats back to its original position; we refer to this behaviour 
as ‘self-adjustment. 

As shown in Methods, the tension of the taper, T, can be estimated 
analytically from 


T=6.19pR°72 f (h— VP = +r) 3/dx (1) 
0 


where J is the viscosity of air, r is the radius of the taper and R is the 
radius of the sphere. When operating at tensions near 3 GPa”, we find 
h=320nm and h=38.2nm for the cases shown in Figs. 2 and 3, respec- 
tively. The self-adjustment of the taper separation from the rotating 
sphere enables critical coupling*® of light into the sphere, whereby 
counter-circulating light experiences optical drag identical in size, but 
opposite in sign. 

To understand how Fizeau drag leads to isolation, we first take 
into account the relativistic addition of velocities when the periph- 
ery of the rotating silica resonator is moving towards or away from 
the input/output ports where the light coupling takes place (Fig. 1). 
On the basis of these considerations, we conclude that, in the lab- 
oratory frame, the refractive indices associated with the clock- 
wise (‘+’) and anticlockwise (‘“—’) whispering-gallery modes are 
n4=n[l+£nv(n~? — 1)/c], where n is the refractive index of silica, 
c is the speed of light in vacuum, v= RI? is the tangential velocity 
of the rotating optical cavity and R is the radius of the resonator. 
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Fig. 1 | Experimental setup. Spinning the silica sphere (the resonator) 
results in different refractive indices for the counter-circulating 
whispering-gallery modes of the resonator. a, b, Illustration of the left- 
propagating ‘pass’ configuration (a) and the right-propagating ‘block 
configuration, where the input light does not match the resonance 
frequency of the cavity (b). c, The setup enables us to perform 
experiments in which light enters from both ports simultaneously. Solid 
(dashed) arrows represent the path of the transmitted (blocked) light. 


a c 
ie ae. | ee 
< [ j 
- : | ga; = 
5 i o6f €6E i = 
3 607 . 4 i 5 
a \ iii : 
q : i 8 
2 4 7 ili 8 
© = zz Z = 
= 20+ \ y ; 
0 L 
0) 100 200 300 
b Frequency (MHz) d 
10 
100 en cesen eweses ese sarezwncer Essex 
= Left, pass = 
Z so” ‘ an 
c E 
fe] = 
g ™ 3 
Ee oO 
5 40 2. 
E 20 3 
FE ey 
i € 
Right, block ee = 
O nn a ee 
-20 -10 0 10 20 
Frequency (MHz) 0 


Fig. 2 | Experimentally measured isolation of 99.6%. a, b, Optical 
transmission of light coming simultaneously from the left (red) and right 
(blue) while scanning the laser frequency. The frequency range shown in 
b corresponds to the region shaded green in a, with 0 MHz corresponding 
to the centre of this region. The error bars show the standard deviation. 

c, Experimentally measured isolation (circles) and theoretical fit (line; 

see Methods); the green shading shows the confidence band for the fit. 
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In the blocked path, the light is absorbed in the resonator and does 

not reach the detector. d, A micrograph of our experimental setup; the 
inset shows the calculated transverse field of the optical resonance. 
Optical polarization in our scheme is either parallel or perpendicular to 
the rotation axis of the sphere, compatible with the polarization of the 
transverse electric or the transverse magnetic mode of the sphere. This 
polarization does not affect the performance when we use our polarized 
light source. 
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80 
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Frequency (MHz) 


Fibre diameter 


Surface of rotating sphere 


The curve peaks at a maximum isolation of 99.6% + 1.6%. d, The airflow 
near the fibre (arrows represent calculated direction; colour represents 
calculated speed) pushes the taper (empty circle) away from the interface 
of the rotating sphere (grey surface at the bottom). For this experiment, 
the optical wavelength is \= 1.55 1m, the radius of the resonator is 
R=4.75mm and its rotation frequency is Q=27 x3,000 rad sol 
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The difference in resonance frequency between the counter-circulating 
modes is then?° 


Ao 90 (2) 


where Weg is the optical resonance frequency fora stationary resonator 
and \ is the optical wavelength in vacuum. Scattering between counter- 
circulating modes” is eliminated in our experiments to prevent 
unwanted reflections (see Methods). In addition, the coupling is set to 
critical*> to prevent transmission in the wrong direction. Under these 
conditions, and taking into account the frequency shift from equation 
(2), the rate equations?’ for the counter-circulating modes, with field 
amplitudes a. and a,, are 


2 2 
Gg T |r (W—-Wrest + nQ) 45>! ap Aout = ; = Gin + ita, 
T T 2 
¢ be es t? |) — “ 
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where w is the optical frequency of the input light, t is the real amplitude 
coefficient of transmittance between the fibre and the resonator, T is 
the circulation time for the mode travelling inside the sphere, a;, (aj, 
is the optical input from the left (right) and @,, (@oy,) is the optical 
output from the right (left). Accordingly, the optical transmission for 
the opposing directions are 


_ a 4 
[=| | Baad 2 
Gin +r (w Wrest ae nQ2) 
ea (4) 
ay a 
T = 1 4 2 2 
Gin P+T (w ~~ Wrest — 72) 


Equation (4) clearly indicates that the transmission in this rotating 
resonator system is non-reciprocal: the equations are identical, except 
for the sign of 7, which facilitates isolation by resonantly absorbing light 
entering from one side, while off-resonantly transmitting the coun- 
ter-propagating light. Importantly, the equations in equation (3) are 
uncoupled because a,,,, is not a function of a;,, and @,,, is not a func- 
tion of a,,, which allows crosstalk-free operation while light enters 
simultaneously from both sides. 

We measured the spectral transmission as a function of the input 
direction (Fig. 2) and as a function of rotation speed (Fig. 3). In the 
first experiment, we demonstrate that our isolator almost completely 
transmits light coming from one side of the fibre while blocking light 
coming from the other side. As expected from equation (4), tuning 
the laser frequency w through the counter-circulating resonance fre- 
quencies, Wrest — 72 and Wrest + 72, reveals oppositely drifted dips in 
transmission in for the two opposing directions (Fig. 2a). The split 
between these counter-circulating modes is larger than their widths, 
which enables light from one side to pass through while the other side 
is blocked (Fig. 2b). Isolation is most challenging when light enters 
from both sides of the isolator at the same time, because the isolator 
might suffer from both imperfect transmission of one beam and backs- 
cattering of the other. 

To measure the degree of isolation, we subtract the power of light 
in the blocking direction (ideally 0%; red, Fig. 2b) from the power of 
light in the transmitting direction (ideally 100%; blue, Fig. 2b), while 
injecting light to the isolator from both sides (Fig. 1c). As can be seen in 
Fig. 2c, we measure a remarkable 99.6% intra-fibre isolation (Methods), 
despite the challenge of simultaneous two-port input. 

In the second experiment, we measure the frequency shift as a func- 
tion of the mechanical rotation rate (Fig. 3). Starting from rest ((2=0) 
as a control experiment, the counter-circulating modes overlap, owing 
to their expected degeneracy (Fig. 3b). As predicted by equation (2), 
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Fig. 3 | The Fizeau shift. a, The Fizeau shift is evident from the split 
between the counter-circulating resonances as a function of the rotation 
speed of the resonator. Circles describe exprimental results and solid 
lines represent linear fits. b, c, Transmission spectrum in a control 
experiment while the resonator is at rest (b; corresponding to the green 
shaded point in a) and while the resonator is spinning at 6.6 kHz (c; purple 
shaded point in a). The experimental setup is shown in Fig. 1. The radius of 
the sphere is R = 1.1 mm. Red (blue) data points are for transmission of light 
coming from the right (left). The error bars are one standard deviation. 


increasing the mechanical rotation frequency 2 results in a linear 
opposing frequency shift of {2 (Fig. 3a) for the counter-circulating 
modes. 

In conclusion, we have demonstrated optical non-reciproc- 
ity experimentally by breaking time-reversal symmetry through 
a mechanically spinning optical resonator. In addition, enabling 
intra-fibre isolation could be beneficial for applications such as fibre- 
linked quantum photon routings”*. We believe that our work can be 
extended to spinning microphotonics, with similar nanometre-scale 
flying heights. When approaching couple-resonator separations of 
300 fm8 (a regime related to femtotechnology), repulsive van der 
Waals forces are predicted to start acting against the large Casimir 
attraction”®. Being highly convex, spheres might be ideal for test- 
ing exceptionally small separation distances, near the boundary 
between the compliant surfaces. Furthermore, with the availability 
of quadrant photodiodes for highly accurate measurements in opti- 
cal tweezers, tweezed counter-rotating spheres, one flying the other, 
might allow tests of gravity at distances shorter than what the current 
state-of-the-art permits. Although challenging, miniaturization and 
faster spinning of our resonators might be possible with continual 
technological improvements in the rotation” and manipulation”? of 
spherical dielectrics. 
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Any Methods, including any statements of data availability and Nature Research 
reporting summaries, along with any additional references and Source Data files, 
are available in the online version of the paper at https://doi.org/10.1038/s41586- 
018-0245-5. 


28 JUNE 2018 | VOL 558 | NATURE | 571 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


Received: 2 November 2017; Accepted: 17 April 2018; 
Published online 27 June 2018. 


1. Fleury, R., Sounas, D. L., Sieck, C. F., Haberman, M. R. & Alu, A. Sound isolation 
and giant linear nonreciprocity in a compact acoustic circulator. Science 343, 
516-519 (2014). 

2. Yang, Z. et al. Topological acoustics. Phys. Rev. Lett. 114, 114301 (2015). 

3. Hasselbach, F. & Nicklaus, M. Sagnac experiment with electrons: observation of 
the rotational phase shift of electron waves in vacuum. Phys. Rev. A 48, 
143-151 (1993). 

4.  Dubreuil, N. et al. Eroded monomode optical fiber for whispering-gallery mode 
excitation in fused-silica microspheres. Opt. Lett. 20, 813-815 (1995). 

5. Spillane, S. M., Kippenberg, T. J., Painter, O. J. & Vahala, K. J. Ideality in a 
fiber-taper-coupled microresonator system for application to cavity quantum 
electrodynamics. Phys. Rev. Lett. 91, 043902 (2003). 

6. Chang, L. et al. Parity-time symmetry and variable optical isolation in 
active-passive-coupled microresonators. Nat. Photon. 8, 524-529 (2014). 

7. Gallo, K., Assanto, G., Parameswaran, K. R. & Fejer, M. M. All-optical diode in a 


8. Ibrahim, S. K., Bhandare, S., Sandel, D., Zhang, H. & Noe, R. Non-magnetic 30 dB 
integrated optical isolator in III/V material. Electron. Lett. 40, 1293-1294 (2004). 
9. Yu, Z. & Fan, S. Complete optical isolation created by indirect interband 


0. Kang, M.S., Butsch, A. & Russell, P. S. J. Reconfigurable light-driven opto- 

acoustic isolators in photonic crystal fibre. Nat. Photon. 5, 549-553 (2011). 

1. Lira, H., Yu, Z., Fan, S. & Lipson, M. Electrically driven nonreciprocity induced by 

interband photonic transition on a silicon chip. Phys. Rev. Lett. 109, 033901 

(2012). 

2. Fan, L. etal. An all-silicon passive optical diode. Science 335, 447-450 (2012). 

3. Poulton, C. G. et al. Design for broadband on-chip isolator using stimulated 

Brillouin scattering in dispersion-engineered chalcogenide waveguides. Opt. 

Express 20, 21235-21246 (2012). 

4. Hafezi, M. & Rabl, P. Optomechanically induced non-reciprocity in microring 

resonators. Opt. Express 20, 7672-7684 (2012). 

15. Peng, B. et al. Parity-time-symmetric whispering-gallery microcavities. Nat. 

Phys. 10, 394-398 (2014). 

16. Lu, L., Joannopoulos, J. D. & Soljaéié, M. Topological photonics. Nat. Photon. 8, 

821-829 (2014). 

17. Shen, Z. et al. Experimental realization of optomechanically induced 

non-reciprocity. Nat. Photon. 10, 657-661 (2016). 
8. Kim, J., Kim, S. & Bahl, G. Complete linear optical isolation at the microscale 
with ultralow loss. Sci. Rep. 7, 1647 (2017). 

19. Fang, K. et al. Generalized non-reciprocity in an optomechanical circuit via 
synthetic magnetism and reservoir engineering. Nat. Phys. 13, 465-471 
(2017). 

20. Ruesink, F., Miri, M.-A., Alu, A. & Verhagen, E. Nonreciprocity and magnetic-free 

isolation based on optomechanical interactions. Nat. Commun. 7, 13662 

(2016). 


572 | NATURE | VOL 558 | 28 JUNE 2018 


periodically poled lithium niobate waveguide. Appl. Phys. Lett. 79, 314-316 (2001). 


photonic transitions. Nat. Photon. 3, 91-94 (2009): corrigendum 3, 303 (2009). 


21. Gross, W. A. Gas Film Lubrication (Wiley, New York, 1962). 

22. Shomroni, |., Rosenblum, S., Lovsky, Y. & Bechler, O. All-optical routing of single 

hotons by a one-atom switch controlled by a single photon. Science 345, 

03-906 (2014). 

ranke-Arnold, S., Gibson, G., Boyd, R. W. & Padgett, M. J. Rotary photon drag 

nhanced by a slow-light medium. Science 333, 65-67 (2011). 

24. Matthewson, M., Kurkjian, C. R. & Gulati, S. T. Strength measurement of optical 

ibers by bending. J. Am. Ceram. Soc. 69, 815-821 (1986). 

25. Malykin, G. B. The Sagnac effect: correct and incorrect explanations. Phys. 

Uspekhi 43, 1229-1252 (2000). 

26. Mazzei, A. et al. Controlled coupling of counterpropagating whispering-gallery 
modes by a single Rayleigh scatterer: a classical problem in a quantum optical 
light. Phys. Rev. Lett. 99, 173603 (2007). 

27. Gorodetsky, M. L. & Ilchenko, V. S. Optical microsphere resonators: optimal 
coupling to high-Q whispering-gallery modes. J. Opt. Soc. Am. B 16, 147-154 
(1999). 

28. Li, J., Liu, B., Hua, W. & Ma, Y. Effects of intermolecular forces on deep sub-10 
nm spaced sliders. /EEE Trans. Magn. 38, 2141-2143 (2002). 

29. Arita, Y., Mazilu, M. & Dholakia, K. Laser-induced rotation and cooling of a 
trapped microgyroscope in vacuum. Nat. Commun. 4, 2374 (2013). 

30. Grier, D. G. A revolution in optical manipulation. Nature 424, 810-816 (2003). 


23. 


Oo2mnNwOood 


> 


Acknowledgements We thank U. Hofi, Z. Katz, Y. Halupovich and B. Khachatryan 
for their help. This work was funded by the Israeli Centers for Research 
Excellence (I-CORE), ‘Circle of Light’ Excellence Center, the Israel Science 
Foundation (2013/15), the Israel Ministry of Science, Technology and Space, 
the MURI Center for Dynamic Magneto-Optics via the AFOSR Award number 
FA9550-14-1-0040, the Army Research Office (ARO) under grant number 
73315PH, the AOARD under grant number FA2386-18-1-4045, the CREST 
under grant number JPMJCR1676, the IMPACT programme of JST, the RIKEN- 
AIST Challenge Research Fund, the JSPS-RFBR under grant number 17-52- 
50023, and the Sir John Templeton Foundation. 


Reviewer information Nature thanks A. Alu and M. Levy for their contribution to 
the peer review of this work. 


Author contributions S.M. and R.D. performed the experiments. A.U.H., H.J., 
F.N., E.M., Y.K. and D.N.C. performed the theoretical analysis. T.C. supervised the 
work. 


Competing interests The authors declare no competing interests. 


Additional information 

Extended data is available for this paper at https://doi.org/10.1038/s41586- 
018-0245-5. 

Reprints and permissions information is available at http://www.nature.com/ 
reprints. 

Correspondence and requests for materials should be addressed to T.C. 
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional 
claims in published maps and institutional affiliations. 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


METHODS 


Backscattering and bandwidth. Our resonators have a relatively low optical qual- 
ity factor (Q < 10°), which results in a larger operation bandwidth and reduces 
unwanted backscattering. 

Bandwidth. Low-Q resonators have broader resonance full-width at half-maxima 
and therefore improved bandwidth. 

Backscattering. Backscattering typically appears in resonators with Q ~ 108, where 
absorption is reduced but scattering is still large. In such ultrahigh-Q resonators, 
frozen thermal capillary waves on the resonator solid-phase boundaries are cre- 
ated during reflow and scatter light later. Because our quality factor is relatively 
low (Q< 10°) and probably governed by absorption rather than scattering, we 
did not observe any traces of backscattering in the spectrum of our rotating- 
resonator isolator (Fig. 2a, b). On the basis of the rotating-resonator isolation 
ratio (Fig. 2c), we conclude that imperfections in isolator performance, including 
those resulting from potential backscattering, are relevant to less than 0.2% of the 
input power. Although we do not exclude the possibility of backscattering effects 
in rotating-resonator isolators, in particular when the surface is rough, we did not 
observe such backscattering in our experiment. 

Fabrication. We rotate a silica cylinder while its end is heated with a hydrogen- 
oxygen flame to reflow into a spherical shape. We did not attempt to increase the 
optical quality factor because it could have reduced the operation bandwidth of 
our isolator. For this reason, we use industrial-grade (rather than ultra-pure) gases 
for our flame. 

Measuring and calculating isolation. Here we define isolation as the left-propa- 
gating (pass) transmission less the right-propagating (block) transmission. For the 
experimental results (black circles, Fig. 2c), the transmissions subtracted are those 
measured experimentally (Fig. 2a). For the theoretical fit (solid line, Fig. 2c), the 
subtracted transmissions are those revealed from fitting the experimental results 
(Fig. 2a) to equation (4). 

The flying height. We observe experimentally that the taper does not touch the 
rotating resonator even if pushed towards it. Indications that the taper veers away 
from the spinning resonator when pushed towards it include our experimental 
observation that there is no wear of the taper or sphere, even after long periods 
of operation, and the fact that the taper does not stick to the rotating resonator, 
even when pushed towards it. By contrast, in a control experiment in which the 
sphere is stationary, the taper does stick to the sphere, through van der Waals 
forces, and needs to be pulled back to break the connection. Using a microscope, 
we then see micrometre-scale craters on the surfaces of the taper and the sphere, 
where contact was established. No such adhesion or abrasion is observed for the 
rotating-resonator case. 

Fluid-film lubrication*! between surfaces provides a simple way to achieve the 
desired motion of a machine element with minimal friction and no wear. Such 
bearings have been studied for almost a century and found to be robust. In more 
detail, liquid- and gas-film”” lubrication have been studied for applications such as 
sand-proof bearings in marine vessels, dating back to 1932”, and have enabled fly- 
ing heights of less than 10 nm in recently demonstrated helium-filled hard drives. 
Although such fliers are generally stable, modelling of them should consider effects 
such as lubricant compressibility and intermolecular forces. 

Reynolds equation has previously been solved*? with boundary conditions of 
a foil wrapping a rotating cylinder. Using similar methods, but with some mod- 
ification*”, the gap between the surface of the rotating cylinder and the foil was 
calculated to be 


2/3 
h= o.oasn' | H2) (5) 


where t' is the tension of the foil per unit width, R and 2 are the radius and 
angular velocity of the cylinder and ji is the viscosity of the fluid. In our case, the 
curvature of the sphere along the polar direction is so small compared with the 
width of the film-lubricated region of interest that it thins by only 1 A by the end 
of this region. For this reason, we can estimate our rotating sphere using equa- 
tion (5), even though it was originally developed for a rotating cylinder. Because 
equation (5) deals with flat foil, whereas our flyer is a wire, we break the circular 
cross-section of our wire into a set of infinitesimal flat stairs. We integrate the 
tension over these stairs to get the total tension T as a function of the minimum 
taper-sphere separation h (see equation (1)). Under typical operation at a fibre 
tensile stress near 3 GPa”4, equation (1) (with some corrections explained below) 
yields h= 38.2 nm for the sphere with R= 1.1 mm (Fig. 3) and h=320nm for the 
sphere with R= 4.75 mm (Fig. 2). 


LETTER 


We now check the various assumptions of equation (1). 
Intermolecular forces. It has been shown that effects such as Casimir and van der 
Waals forces begin to attract the flyer towards the rotor when the gap between 
them is reduced to less than 10 nm, and to strongly repel them when the gap is 
narrowed further, normally to below 300 fm”*. Taking intermolecular forces into 
account”®, equation (1) becomes 


¥ 3/2 
T= 6.19 R70 [ —__4+__— dx 
9 (ANI x" +r (6) 
2 
49 am B : Tv ii 
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where A and B are the Hamaker constants and hi is the reduced Planck constant. 
As expected, this intermolecular-force correction was found to be negligible in our 
experiment, with a contribution to the tension of less than 1% of the gas-lubricant 
contribution. Nonetheless, we do not exclude the possibility that such intermolec- 
ular forces might be relevant in future experiments, for example, when the rotation 
speed is reduced and the optical flyer is closer to the dielectric sphere. 

Lubricant compressibility. We considered the effects of lubricant compressibil- 
ity, using previously described methods", and found them to be relevant. In 
more detail, air was calculated to be up to 6.6 times denser at the lubrication 
region, requiring a correction of the constant in equation (1)! (see Extended Data 
Table 1). 

Tapered-fibre stiffness. We calculated the effects of the tapered-fibre stiffness, 
using a previously reported method*, and found that they satisfy a condition that 
demonstrates the validity of equation (1). 

Wrap angle. We calculated the effects of wrap angle, using a previously reported 
method’, and found that they introduce a correction of less than 1% in the con- 
stant coefficient in equation (1). 

Fluid inertia. We calculated the effects of fluid inertia were calculated, using a previ- 
ously reported method”, and found that they satisfy a condition that demonstrates 
the validity of equation (1). 

Mean free pass. At a certain small scale, we can no longer consider air as a 
continuum and so have to consider the mean free path?” between individual 
air molecules. This issue in the gas-film lubricant is treated here by using the 
Knudsen number, defined by the ratio between the flying height h and the mean 
free pass*!. 

For the case shown in Fig. 2, the Knudsen number is 0.13—corresponding to 
a regime in which continuous flow can be assumed and equation (1) needs no 
corrections. 

For the case shown in Fig. 3, the Knudsen number is 1.4—corresponding 
to a regime in which free molecular flow effects should be taken into account. 
Calculating such flows requires a molecular-based model such as the Fukui- 
Kaneko model**. Several other models for molecular gas-film lubrication have 
been reviewed® to provide an order-of-magnitude variation in results at this 
Knudsen number. 

Numerically calculating the field flow. We calculate the airflow near the taper 
numerically using Ansys—Fluent software (https://www.ansys.com), assuming 
compressible flow. 

Data availability. The data that support the findings of this study are available 
from the corresponding author on reasonable request. 
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Extended Data Table 1 | Effects of lubricant compressibility 


Case Lubricant Compressibility Flying height 
compressibility reduces the constant [nm] 
in equation (1) by 
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A standing molecule as a single-electron field 


emitter 


Taner Esat!?, Niklas Friedrich”, F. Stefan Tautz!2* & Ruslan Temirov!? 


Scanning probe microscopy makes it possible to image and 
spectroscopically characterize nanoscale objects, and to 
manipulate’? and excite** them; even time-resolved experiments 
are now routinely achieved™!°. This combination of capabilities has 
enabled proof-of-principle demonstrations of nanoscale devices, 
including logic operations based on molecular cascades", a single- 
atom transistor’”, a single-atom magnetic memory cell’? and a 
kilobyte atomic memory". However, a key challenge is fabricating 
device structures that can overcome their attraction to the underlying 
surface and thus protrude from the two-dimensional flatlands of the 
surface. Here we demonstrate the fabrication of such a structure: 
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Fig. 1 | Creation of a standing molecule. a, Schematic side view of a 
standing PTCDA molecule. b, Constant-current STM images show the 
stepwise assembly of a PTCDA + 2Ag complex by lateral manipulation 
of silver atoms with the tip of the SPM. The structure of PTCDA has 
been overlaid. c, AFM image of the PTCDA + 2Ag complex, measured 
with a tip that has been functionalized with a carbon monoxide molecule 
(see Methods). d, The three-dimensional trajectory that was used to lift 
up the PTCDA + 2Ag complex. The trajectory proceeds from the contact 
point (labelled ‘bond formatior and located above the white cross in 
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we use the tip of a scanning probe microscope to lift a large planar 
aromatic molecule (3,4,9,10-perylenetetracarboxylic-dianhydride) 
into an upright, standing geometry ona pedestal of two metal (silver) 
adatoms. This atypical and surprisingly stable upright orientation of 
the single molecule, which under all known circumstances adsorbs 
flat on metals!>!°, enables the system to function as a coherent single- 
electron field emitter. We anticipate that other metastable adsorbate 
configurations might also be accessible, thereby opening up the third 
dimension for the design of functional nanostructures on surfaces. 
The fabrication of a standing molecule on an Ag(111) surface 
(Fig. 1a) proceeds in three steps. First, two silver adatoms are attached 
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cand d) in two segments (indicated by red arrows), through the bond- 
breaking point. The colour of the trajectory refers to the measured AFM 
signal Af. e, Side view of the standing molecule, including the silver atoms 
at the surface contact. The bar shows the AFM signal Af at tip height of 
z=17.5A above the surface (region in f indicated by the red dotted box). 
f, AFM image of the standing molecule, recorded at z= 17.5 A. 

g, Approach curves of the SPM tip above the standing molecule (red) and 
above the bare Ag(111) surface (blue). The ordinate shows the frequency 
shift Af of the AFM, which is proportional to the vertical force gradient. 
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Fig. 2 | Stability and geometry of the standing molecule. a, AFM images 
above a standing molecule. For clarity, only selected images of the dense 
stack, measured in steps of Az=0.15 A, are shown. b, Lateral forces F, 
(top) and F, (bottom) exerted by the tip on the standing molecule at 
z=17.5A. ¢, Lateral manipulation trajectory (black line). Only hollow 
sites of the Ag(111) surface are visited. d, Orientation of the standing 


to one of the anhydride groups of the 3,4,9,10-perylenetetracarboxylic- 
dianhydride (PTCDA) molecule that we use in this study!7!8, This 
proceeds by in-plane manipulation with the tip of a scanning probe 
microscope (SPM)’, as shown in the series of scanning tunnelling 
microscopy (STM) images in Fig. 1b. The high-resolution atomic 
force microscopy (AFM) image in Fig. 1c reveals an asymmetry of 
the PTCDA + 2Ag complex, which demonstrates that the chemical 
structure of PTCDA has been modified. It also shows that the complex 
sits on the surface in an essentially planar configuration, with a tilt of 
4.0° + 0.6°, as determined using the method reported in ref. ae 

In the second step, the tip is moved towards the carboxylic oxygen on 
the pristine side of the molecule (cross in Fig. 1c). During approach, the 
current I through the junction is monitored. The approach is stopped 
immediately after the current increases abruptly, because this indicates 
the formation of a covalent bond between the tip and the molecule”. 

In the third step, the tip with the molecule attached is retracted along 
a customized three-dimensional trajectory (Fig. 1d)”. Initially, this tra- 
jectory mimics the peeling of an adhesive from a desk, carried out with 
the constraint that the far end of the molecule (where the silver atoms 
are attached) remains stationary on the surface. Later, the tip is lifted 
vertically, straining the tip—molecule-surface junction of the now- 
upright molecule until the bond between the molecule and the tip 
breaks. It is clear that this bond is weaker than the bond to the surface, 
because the latter consists of two Ag-O bonds, whereas the former 
has only one. 

The AFM image in Fig. 1f, scanned 17.5 A above the surface, reveals a 
prolate object with linear dimensions of 7.8 A x 5.9 A x 17.5A (Fig. 1f, g), 
close to what is expected for an upright molecule. This demonstrates 
that after the bond to the tip has broken the molecule remains stand- 
ing on the surface (Fig. le). Similar molecules to PTCDA can also be 
erected (Extended Data Fig. 1), and two standing molecules can be 
placed at desired positions relative to each other (Extended Data Fig. 2). 

In our experience, the standing molecule never falls back to the 
surface by itself. To learn more about this remarkable stability of the 
standing molecule, we study its interaction with the tip. We acquire 
AFM images at different heights above the molecule (Fig. 2a) and inte- 
grate the frequency-shift signal Af(x, y, z), which is proportional to the 
force gradient OF (x, y, z)/Oz, twice along z at each lateral position (x, y) 
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molecule, as derived from Extended Data Fig. 5. Black and red symbols 
indicate the possible positions of one of the silver atoms at the surface 
contact, when the other sits in the centre. During both translations 

and rotations, the standing molecule never jumps over a surface atom. 
Images in c and d were recorded at z=17.5 A. The lateral and rotational 
manipulation procedures are shown in Extended Data Fig. 3. 


(ref. 2). We thus obtain the full interaction potential U(x, y, z) between 
the tip and the standing molecule. Figure 2b displays (x, y) maps of the 
lateral forces F,,= —OU/Ox and F,= —OU/0y that act on the molecule 
for a tip height just above the upper end of the molecule. These forces 
are attractive, reaching maximum values of F, 80 pN and F,~110pN. 
For larger forces, which act at even closer distances, the molecule jumps 
laterally on the surface towards the tip, keeping its upright orientation 
(Extended Data Fig. 3a). We therefore did not succeed in toppling the 
molecule over mechanically. Note that the threshold force for the lateral 
displacement of the standing molecule is similar to the forces required 
to move single atoms”. 

The force constant for tilting the molecule out of the upright orien- 
tation can be estimated from Fig. 2b (see Methods and Extended Data 
Fig. 4 for details). The standing molecule can also be rotated around its 
long axis by injecting a current (Extended Data Fig. 3b). By combining 
translational and rotational manipulation, we can determine the struc- 
ture at the interface between the standing molecule and the Ag(111) 
surface. We find that the standing molecule adsorbs with its silver 
atoms in hollow sites of the Ag(111) surface (Fig. 2c). The epitaxial reg- 
istry between the essentially rigid Ag—O arrangement in the standing 
molecule and the hollow sites in the Ag(111) surface determines the 
possible orientations of the standing molecule (Fig. 2d, Extended Data 
Fig. 5). 

We now discuss the charging and field-emission properties of the 
standing molecule. The corresponding processes are shown schemati- 
cally in Fig. 3a. In the standing configuration, there is negligible overlap 
between the metal surface and the lowest unoccupied molecular orbital 
(LUMO) of the standing molecule—a 7 orbital that has its lobes ori- 
ented perpendicular to the plane of the molecule. Hence, the electronic 
coupling between this orbital and the surface is weak, and the charge 
state of the molecule, functioning as a quantum dot, can be controlled 
with single-electron precision. When a negative bias voltage V is applied 
to the surface, the molecular levels are gated with respect to the Fermi 
energy Ey of the substrate. At a critical bias voltage, the unoccupied 
molecular orbital shown in Fig. 3a aligns with Ep and one electron 
tunnels from the surface into this orbital. This charging event appears 
in the Af signal of the AFM as a sharp dip*”? (red curve in Fig. 3a) 
and in the field-emission current as a steep rise (blue curve). Once on 
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Fig. 3 | A standing molecule as a single-electron field emitter. a, Energy- 
level diagrams before charging (top right) and in the field-emission regime 
(top left), along with the frequency shift Af(V) (red) and field-emission 
current I(V) (blue; the dotted black line is a Fowler—-Nordheim fit) at 
z=56.0A (bottom). Es Fermi energy; q, elementary charge; V, bias 
voltage; ®, work function; |r—rj|, distances between the tip and the field- 
emission points. b, Field-emission image (top left) and line profile (bottom 
left; blue) above a standing molecule at z=73.5 A, with V= —24.7 V. The 
black line shows a profile after filtering out the background, and multiplied 
by a factor of 25. The signal-to-background ratio is about 2%. A scale 
drawing of the field-emission experiment is shown on the right. 

c, Successive field-emission images (without the background), taken at 

the same conditions as for b. The arrow indicates the time line of the 


the standing molecule, the additional electron may tunnel through the 
trapezoidal barrier to the tip (Fig. 3a), giving rise to a field-emission 
current that follows the well-known Fowler-Nordheim behaviour 
I(V) x (V*/®)exp(—(|r — r|®*"/| V|), where @ is the work function and 
Gis a constant”. For the fit in Fig. 3a, we have replaced |r — r;| by the 
vertical distance d between the top of the standing molecule and the tip. 
An image of the field-emission current in the detection plane above 
the standing molecule is shown in Fig. 3b. Outside an elliptical area 
there is no field emission. This result is a consequence of the fact that 
for lateral tip positions too far away from the standing molecule the 
gating is too weak to provoke electron transfer into the unoccupied level 
in Fig. 3a, which also precludes field emission. (It also shows that in 
our experiment field emission occurs exclusively through the stand- 
ing molecule.) The current appears to increase exponentially towards 
the centre of the elliptical field-emission area (Fig. 3b), in line with 
the Fowler-Nordheim equation. However, closer inspection reveals 
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experiment. The two dotted white lines indicate the rotation of the 
interference pattern, and hence of the molecule, by 30°, from a 30° 
orientation to a 60° orientation, as defined in Fig. 2d. d, Bottom, side and 
top view of the LUMO of PTCDA®”. The colour indicates the phase of the 
wavefunction: j= 0 (red) or j= 5 (blue). Top, interference of spherical 
waves from different points r; within an extended orbital, at the position r 
of the tip. The lobes of the LUMO of PTCDA are approximated by grid 
points located within cuboids. e, Interference pattern |>7, U(r). 

f, Gundlach oscillations ore |: . g, Calculated field-emission pattern 
| yj TV, r)( r)|°. Imagés‘in Ay if were calculated using grid and phases 4; 
as in d (see insets), and with z and Vas in b andc. h, As for g, but with 
(pj = 0 everywhere. All field-emission images are shown in the orientation 
of Fig. 2c, d. 


additional structure. After filtering out the background that originates 
from field-emitted electrons that hit the tip away from its apex, we find a 
pattern of concentric rings that are modulated by a four-fold-symmetric 
cloverleaf (Fig. 3c). Figure 3c also shows that the pattern is locked to the 
molecule: after about one-third of the second of two consecutive images, 
the molecule and with it the field-emission pattern rotated by about 30°. 
Modelling the pattern in Fig. 3c demonstrates that the standing mol- 
ecule is a coherent field emitter. From each point r; within the emitting 
orbital, a spherical electron wave 
2n |r—r il 
era _ 


emanates (Fig. 3d). Here \=h/[2m(qV — ®)] 4 isthe de Broglie wave- 
length of the electron, with mass m and elementary charge q, qV — Pis 
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the kinetic energy of the electron just before it reaches the tip (Fig. 3a), 
his Planck’s constant and y;is the phase of the emitting orbital at point 
r;. Because all of these waves originate from the same orbital, they are 
coherent and interfere at the tip position r, giving rise to an interference 
pattern as the tip is scanned in the detection plane. Simulations of the 
interference with phases y; of the LUMO of PTCDA (Fig. 3d) also 
produce a cloverleaf pattern (Fig. 3e), as in experiment. The additional 
concentric rings in Fig. 3c originate from field-emission resonances, as 
illustrated in Fig. 3f. Having passed the trapezoidal barrier, electrons 
may become transiently trapped between the metal tip and the bar- 
rier, because reflections at the tip and the barrier lead to the formation 
of resonance states. Because barrier tunnelling preserves coherence, 
these resonances lead to an oscillatory barrier transmission (Gundlach 
oscillations)**: 


exp(—B|r—r}|) 
T(V,1) 0 +" (2) 
C+ sin(D|r—r;|) 


where B, C and D are constants that depend on the applied voltage V 
and the work function & (see Methods). 

Combining the effects of coherent electron emission from an 
extended orbital and Gundlach oscillations at a trapezoidal barrier, we 
modulate the amplitudes of the spherical waves V(r) with the barrier 
transmission T;(V, r), sum over all points r; within the orbital from 
which electrons are emitted and calculate the square modulus 
| yy, TV, ryw(r)/. This yields the field-emission pattern shown in 
Fig. 3g, which is in excellent agreement with our experiment (Fig. 3c). 
This agreement confirms the successful fabrication of standing mole- 
cules and shows that the erected molecular structure keeps its vertical 
orientation even at extreme current densities of 10° A m~. 

Cascading the charging of the quantum dot and field emission has 
several benefits. First, it enforces pre-emission coherence of the ini- 
tial state across many atoms, thus enabling the realization of coherent 
emitters that consist of many more than one atom”. Owing to chemical 
perfection, in principle it is also possible to make emitters of arbitrary 
shapes, including fully coherent double-slit emitters”°. Second, cascad- 
ing permits the emission of single electrons on-demand”, if combined 
with time-resolved gating”*®. Arbitrarily shaped on-demand emitters, 
arranged regularly on a surface, could be particularly promising for 
experiments with coherent electrons, including low-energy electron 
holography”. 


Online content 

Any Methods, including any statements of data availability and Nature Research 
reporting summaries, along with any additional references and Source Data files, 
are available in the online version of the paper at https://doi.org/10.1038/s41586- 
018-0223-y. 
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METHODS 


Sample preparation. An atomically clean Ag(111) surface was prepared in ultra- 
high vacuum by repeated cycles of Ar* sputtering and annealing at 800 K. A small 
coverage of PTCDA molecules (less than 20% of a monolayer) was deposited onto 
the clean Ag(111) surface, held at room temperature, from a custom-built Knudsen 
cell. After deposition, the sample was moved into the microscope and cooled to 5K. 
Heating a thin silver wire, we evaporated silver atoms onto the Ag(111) surface in 
situ at T~ 10K, thus preventing clustering. 
Tip preparation. We used a PtIr wire as the SPM tip, sharpened with a focused-ion 
beam (FIB). The Ptlr tip was treated in situ by applying controlled voltage pulses and 
indentations into the clean Ag surface, resulting in a clean Ag-covered tip. The clean- 
ness of the tip was validated by the spectroscopic signature of the Ag(111) surface state 
and STM imaging of the LUMO of PTCDA. For the high-resolution AFM imaging 
shown in Fig. 1c, the tip was functionalized with a carbon monoxide molecule*!. 
NC-AFM/STM experiments. The experiments were carried out in a non-con- 
tact atomic force/scanning tunnelling microscope (NC-AFM/STM; CREATEC) 
under ultrahigh-vacuum conditions at a temperature of T~5 K. The microscope is 
equipped with a qPlus sensor (CREATEC) and operated in frequency-modulation 
mode (resonance frequency fo © 31 kHz, quality factor Q~ 4 x 10, spring constant 
k=1,800N m“!, oscillation amplitude A ~0.2-0.3 A). All bias voltages quoted here 
were applied to the sample while the tip was grounded. STM images were acquired 
in constant-current mode. The AFM images were recorded in constant-height 
mode at V=2 mV and show the frequency shift Af(z) © —[fo/(2k)]dF,/dz. Field- 
emission images were recorded in constant-height mode with qPlus oscillations 
turned off, and show the current I. 
Structure of the PTCDA + 2Ag complex. Density functional theory (DFT) cal- 
culations were used to determine the structure of the PTCDA + 2Ag complex in 
the gas phase. They were carried out with GAMESS* using a 3-21 G basis set 
including polarization functions for heavy atoms and the hybrid Becke three-pa- 
rameter Lee-Yang-Parr (B3LYP) functional. The structure of the standing mol- 
ecules shown in Fig. 1 and Extended Data Fig. 1 is the gas-phase structure of the 
PTCDA + 2Ag complex. 
Tilting stiffness of the standing molecule. When moving the tip towards the stand- 
ing molecule along the y direction (Fig. 2a, b), the molecule cannot succumb to the 
attractive force by tilting towards the tip, because this would require breaking one 
of the Ag—O bonds at the surface. The F, force map shown in the bottom panel of 
Fig. 2b therefore reveals the intrinsic range of the attractive force between the tip and 
the molecule. By contrast, when the tip is moved along the x direction (top panel of 
Fig. 2b), the molecule is able to tilt towards the tip. This leads to the more prolate fea- 
tures in the F, map. Comparing the F, and F, maps in Fig. 2b, we can estimate the 
force constant for the initial tilting of the molecule out of the upright orientation. 
In detail, this proceeds as follows. We model the force between the tip and a 
corner oxygen atom, which is the most reactive part of the molecule, as 


F(r) =Ae"/’e, (3) 


where r is the distance between a corner oxygen atom and the tip apex, e, the unit 
vector in this direction and p an appropriate decay constant. For the approach along 
the y direction, we know that the molecule cannot bend towards the tip. When 
calculating the force acting from the tip onto the molecule, we take into account 
that there are two corner oxygen atoms, separated by 2a along the y axis, for each 
of which the force law in equation (3) holds. We therefore obtain for the lateral 
component of the force onto the molecule 


E(y)= Ale 4/759) cos ax, +e 4/522) cos a, } (4) 


where d is the vertical distance between the top of the molecule and the imaging 
plane, a; =arctan[d/(y+a)] and a =arctan[d/(y — a)] (Extended Data Fig. 4a). 
Equation (4) can be used to fit the experimental F,(y) profile in Extended Data 
Fig. 4c (black data points), extracted from the bottom panel of Fig. 2b, using the 
amplitude A and the decay length p as fit parameters. As the blue curve in Extended 
Data Fig. 4c shows, we obtain an excellent fit for p= 119 pm. 

In the next step, we use the force law of equation (3) to extract the bending 
stiffness of the standing molecule when the tip approaches along x. The F,(x) 
profile, extracted from the top panel of Fig. 2b, is shown in Extended Data Fig. 4c 
as red data points. This force corresponds to an equilibrium between the lateral 
attraction F,(x), which acts between the tip and the molecule, tilting the molecule 
towards the tip, and the lateral component Fy,,(x) of the restoring force that tries 
to bring the molecule back into the vertical (see Extended Data Fig. 4b). For each 
x, we therefore have to find the tilting angle 0.q(x) for which the force equilibrium 


[B(%, Oeq(X))1 = [Fa x25 Aoq()) | (5) 


holds. For the restoring force Fy, we assume Hook’s law for the corresponding 
torque, [Fy K90, where | is the length of the standing molecule and kg is the 
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torsional spring constant. Its x component therefore becomes Fy, (K96/I)cos0. 
The total force on the molecule, Fyotai =|F, + F)| in Extended Data Fig. 4b, is then 


Fetal = 2\F(r) |sin y (6) 


and by symmetry located in the x-z plane. From Extended Data Fig. 4b, we obtain 
the following geometric relations: 


a 
cosy = — 
r 


raf (Xm)? +a? + (7) 


Xm =l sind 
b=s+d=(x—x,,)tana 


where 7, a, s, b and Xp are defined in Extended Data Fig. 4b. Projecting Fiotai 
(equation (6)) onto the x direction, we obtain 


FE. = 2|F(r) |sin y cosa (8) 


With the help of equation (7), F, can be expressed as a function of (x, 0). For each 
tip position x, the tilting angle 0 is then adjusted to fulfil the equilibrium condition 
(equation (5)). The results are shown in Extended Data Fig. 4c-e, in which the left 
panels display the simulated lateral force F, (green curve), in comparison with the 
experimental F, (red data points) and F, (black data points); the corresponding 
tilt angles @.q(x) and linear elongations x(x) are plotted in the middle and right 
panels. Extended Data Fig. 4c shows the best fit, yielding an initial tilting stiffness 
of kg=630ZN m rad“!, corresponding to k = K9/l’=0.38N m_ '. For comparison, 
the stiffness of a carbon monoxide molecule at the tip ofa SPM is k=0.24N m7}, 
whereas on a Cu(111) surface it is «= 1.64N m! (ref. °°). Extended Data Fig. 4d 
shows that for large tilting stiffnesses the simulated F,, for the approach along x 
converges to the measured F, for approach along y, as it must. Finally, Extended 
Data Fig. 4e illustrates the effect of a soft restoring-force constant on F, and the 
molecular tilt 0.4 or elongation xm. We note that, owing to anharmonicity in Fy and 
other secondary effects, our model is applicable for small x only. 

Structure at the interface between the standing molecule and the Ag(111) 
surface. By combining translational and rotational manipulation of the standing 
molecule, we can determine the structure at the interface between the standing 
molecule and the Ag(111) surface. 

A single translational manipulation step is illustrated in Extended Data Fig. 3a. 
By repeating this procedure several times, the standing molecule is moved along 
a specific trajectory on the surface. Part of this trajectory is plotted in the AFM 
image in Fig. 2c. Analysing the trajectory, we find that possible adsorption sites 
of the standing molecule form the same lattice as the hollow sites on the Ag(111) 
surface (Extended Data Fig. 6). This suggests that the silver atoms of the standing 
molecule sit in hollow sites. Rotational manipulation data verify this conjecture. 

A single rotational manipulation step is illustrated in Extended Data Fig. 3b. The 
standing molecule tends to rotate towards the tip position. Analysing a large number 
of erected and/or rotated molecules (Extended Data Fig. 5), we find that standing 
molecules exist only in six distinct lateral orientations on the Ag(111) surface. These 
are indicated by the three black lines and three red lines in Fig. 2d. We know that the 
distance between the silver atoms in the standing molecule is approximately 2.9 A, 
because in the AFM images of the standing molecule this is the distance between the 
maxima of the attractive force, which acts between the oxygen atoms at the upper 
end of the standing molecule and the silver atom at the apex of the tip (Fig. le). 
Because this Ag—Ag distance fits well to distances between the hollow sites of the 
Ag(111) surface, which amount to 2.9A (red squares in Fig. 2d) and 3.4A (black 
circles), we are led to the conclusion that the silver atoms of the standing molecule 
do indeed sit in hollow sites. Evidently, the epitaxial registry between the essen- 
tially rigid Ag—O arrangement in the standing molecule and the hollow sites of the 
Ag(111) surface determines the possible orientations of the standing molecule. Note 
that in AFM images molecules in the orientation shown in black in Fig. 2d appear 
asymmetric (for example, in Extended Data Fig. 3b after manipulation), because the 
mismatch between the silver atoms in the standing molecule and the hollow sites in 
this surface direction enforces a tilt of the standing molecule. 

Altogether, we have built and erected at least 107 different PTCDA molecules. 
We measured the orientation of only some of them. To obtain a larger dataset 
regarding the orientation of standing molecules, we rotated several standing mol- 
ecules, as demonstrated in Extended Data Fig. 3b. The total number of measured 
orientations is therefore 128 (Extended Data Fig. 5). However, to avoid any direc- 
tionality during the collection of rotation statistics, we placed the tip above the 
centre of the standing molecule (not next to it, as in Extended Data Fig. 3b). By 
this procedure the standing molecule rotates randomly. 

The two Ag-Ag distances of 2.9 A and 3.4A also correspond to different types 
of hollow site of the Ag(111) surface. A spacing of 2.9 A coincides with the distance 
between two adjacent hexagonal close packed (hcp) or two adjacent face-centred 
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cubic (fcc) sites (red squares in Fig. 2d), whereas for a spacing of 3.44 one of the 
hollows is an hep site and the other is an fcc site. We find that the fec/hcp config- 
uration (black circles) is less abundant than the fcc/fec or hcp/hcp configuration 
(red squares): 52 versus 76 out of a total of 128 (see Extended Data Fig. 5). This is 
explained by the better fit of the fcc/fcc and hcp/hcp configurations to the natural 
distance of two silver atoms bound to the dianhydride group of PTCDA (2.9 A) 
and the higher degeneracy of these configurations. 

A distinction between fcc/fec and hcp/hcp configurations is not possible in 
either AFM or the field-emission experiment. The field-emission images in Fig. 3c 
correspond to different adsorption configurations, because a rotational step of 
30° from the 30° to the 60° orientation (Fig. 2d), as seen in the field-emission 
images, must correspond to a change from an fcc/hcp to an hcp/hep or fec/fec 
configuration (Fig. 2d). However, the interference patterns look identical within 
experimental accuracy, showing that the effect of different adsorption configura- 
tions on the emitted electrons is negligible. 

Stabilization mechanism of the upright configuration. Manipulation experi- 
ments on the standing molecule reveal elements of the stabilization mechanism 
of the upright configuration. We observe that a positive bias voltage applied to 
the surface topples the molecule over (Extended Data Fig. 3c), suggesting that 
the standing molecule is negatively charged. Similarly, if two standing molecules 
are brought very close to each other, they both topple over. This observation is 
explained either by the electrostatic force between the standing molecules or by 
the mutual destabilization of the charge on each of them. The fact that the standing 
molecule acts as a quantum dot with integer charge states (Fig. 3a) suggests that 
the stability of its upright configuration may result from electron-electron corre- 
lations. This would explain why DFT calculations do not find a metastable upright 
configuration for the PTCDA + 2Ag complex on Ag(111) and sets our case apart 
from others, where upright standing configurations are stabilized by intermolecular 
interactions!>*4~** or directional bonds to the surface*”~*°. 

Detection of the field-emission current. We detect the field-emission current with 
the tip of our STM/AEM. Unlike in tunnelling experiments, the current is detected 
over the complete surface of the tip. Specifically, the field-emission current does 
not distinguish between electrons that hit the tip apex (well-defined path length) 
and all other positions on the tip surface (no well-defined path length). The sum 
of the latter contributes to the large background in the field-emission current. The 
background that arises from our mode of detecting the field emission is thus not 
necessarily incoherent, although other mechanisms that create decoherence may 
be present. The signal-to-background ratio is tip-dependent, and approximately 
2% in the data in Fig. 3b, c and Extended Data Fig. 7. 

Modelling the interference pattern. In accordance with ref. “4, we used the follow- 
ing expressions in equation (2) for the barrier transmission T;(V, r) for electrons 
at the Fermi level: 


D= 4.J/2m (qV— 63/2 
3 hh qv 


The work function @ of Ag(111) was taken to be equal to 4.4 eV". When sim- 
ulating the field-emission images in Fig. 3e-h, grid points r; within an isosurface 
that contains 90% of the charge density of the orbital were used. 

In equation (1), we have approximated the de Broglie wavelength \ of the 
field-emitted electron by its value just before it reaches the tip. In reality, its 
energy and therefore also change on its passage towards the tip. This issue has 
been considered previously”®, and it was shown that the effect of the electron 
acceleration should not induce substantial changes in the interference patterns 
observed. We validated this directly by collecting field-emission images at 
different voltages (Extended Data Fig. 7). Gundlach transmission in equation 
(2) takes the acceleration of the electrons into account, because it is calcu- 
lated explicitly for a trapezoidal barrier. Hence, the fine structure of the field- 
emission pattern that originates from Gundlach resonances is not affected by 
the above approximation. 

The barrier transmission in equation (2) has an exponential dependence 
on the distance between the emission and detection points. Thus, the contri- 
butions from the lower parts of the emitting orbital to the interference pattern 
(Fig. 3e-h) are negligible. Nevertheless, the amplitude and phase distributions 
that we observe for the upper part of the molecule can exist only if the state 
of the electron before emission is coherent over all 38 atoms of the molecule, 
because the molecular orbital from which the electrons are emitted is extended 
over the complete molecule, regardless whether emission from the lower part 
is negligible. 

Data availability. The datasets generated and analysed during this study are avail- 
able from the corresponding author on reasonable request. 
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Standing NTCDA + 2 Ag [a2 A 
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Extended Data Fig. 1 | A standing NTCDA molecule. a, Constant- c, Schematic side view of standing PTCDA and NTCDA molecules. The 
current STM image of an NTCDA (1,4,5,8-naphthalenetetracarboxylic- length difference of 4.2 A between the two molecules corresponds well 
dianhydride) molecule and two silver adatoms, recorded before the with the tip-height difference Az=17.5 A — 13.5A=4.0A between 
assembly of a NTCDA + 2Ag complex. b, AFM image of a standing the AFM images of the standing PTCDA (see Fig. 1f) and NTCDA (b) 


NTCDA molecule, recorded at a tip height of z= 13.5 A above the surface. molecules. 
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Extended Data Fig. 2 | Two standing PTCDA molecules. a, Constant- molecules, recorded at tip height of z= 17.5 A above the surface (left). One 
current STM image of two PTCDA + 2Ag complexes that were assembled of the standing molecules is then moved closer to the other by the lateral 
in the same way as shown in Fig. 1b. b, AFM image of the standing manipulation procedure demonstrated in Extended Data Fig. 3a (right). 
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Extended Data Fig. 3 | Manipulation of the standing molecule. a, Lateral 
movement of the standing molecule by tip approach. The white cross in 
the AFM image on the left indicates the tip position during manipulation 
(V=2mV). b, Rotational movement of the standing molecule by a current 
pulse. The white cross in the AFM image on the left indicates the tip 
position during manipulation (z= 17.5 A). The molecule jumps from a red 


t(s) 
(symmetric) to a black (asymmetric) position (see Methods). c, Toppling 
over the standing molecule to the surface by using a positive bias-voltage 
sweep. The white cross in the AFM image on the left indicates the tip 
position during manipulation (z= 17.5 A). The constant-current STM 
image on the right shows the PTCDA + 2Ag complex after the toppling. 
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Extended Data Fig. 4 | Determining the tilting stiffness of the standing 
molecule. a, Approach along the y direction as defined in Fig. 2a, b. 
Parameters are d= 3.25 A, /= 12.9 A and 2a=4.55 A; d+1+z, because 
the tip height z is measured from the centre of the uppermost surface 
layer, whereas / is measured from the centre of the two silver adatoms 

in the PTCDA + 2Ag complex. b, Approach along the x direction as 
defined in Fig. 2a, b. The sketch on the left shows a side view, whereas 

on the right a perspective view onto the top of the molecule is drawn. 
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c, Left, best fit (green line) of the experimental F, (on x axis, red circles), 
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obtained with the model in equation (8) and kg =630zN m rad“! 

(« =0.38 N m_’). Black data points display F, (on y axis), fitted with 
equation (4) (blue line). d, As in ¢, but for a simulated curve (green) with 
kg=20.0aN mrad! (k= 12.02N m~!), which is too stiff to reproduce 
the experimental F,, (red). e, As in c, but for a simulated curve (green) 
with kg=310zN m rad~! (« =0.19N m7), which is too soft. For a 
detailed discussion of this figure, see Methods. Tilt angles 0.q and linear 
elongations x are plotted in the middle and right panels in c-e. 
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Orientation of the standing molecule (°) 
Extended Data Fig. 5 | Orientation of the standing molecule. The 
histogram (left) shows all evaluated orientations of standing PTCDA 
molecules on the Ag(111) surface. The angle is defined as in Fig. 2d and 
Extended Data Fig. 3b. In total, we evaluated the orientations of 128 


standing PTCDA molecules. The possible orientations on the Ag(111) 
surface are illustrated on the right. Black and red symbols indicate the 
possible positions of one of the silver atoms at the surface contact, when 
the other sits in the centre. See also Fig. 2d and Methods. 
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Extended Data Fig. 6 | The Ag(111) lattice. Constant-current STM image 
of the atomically resolved Ag(111) surface. In all experiments, the Ag(111) 
lattice orientation was as shown in this image. 
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Extended Data Fig. 7 | Field-emission images. a—c, Successive field-emission images (without the background) recorded at z= 73.5 A and bias voltages 
of V=—24.00 V (a), V= —24.35 V (b) and V= —24.70 V (c). 
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Self-assembly of highly symmetrical, ultrasmall 
inorganic cages directed by surfactant micelles 


Kai Ma!, Yunye Gong?, Tangi Aubert!*, Melik Z. Turker!, Teresa Kao!, Peter C. Doerschuk** & Ulrich Wiesner! 


Nanometre-sized objects with highly symmetrical, cage-like 
polyhedral shapes, often with icosahedral symmetry, have recently 
been assembled from DNA’*, RNA‘ or proteins” for applications 
in biology and medicine. These achievements relied on advances 
in the development of programmable self-assembling biological 
materials’~!°, and on rapidly developing techniques for generating 
three-dimensional (3D) reconstructions from cryo-electron 
microscopy images of single particles, which provide high-resolution 
structural characterization of biological complexes!!-!°. Such single- 
particle 3D reconstruction approaches have not yet been successfully 
applied to the identification of synthetic inorganic nanomaterials 
with highly symmetrical cage-like shapes. Here, however, using a 
combination of cryo-electron microscopy and single-particle 3D 
reconstruction, we suggest the existence of isolated ultrasmall (less 
than 10 nm) silica cages (‘silicages’) with dodecahedral structure. 
We propose that such highly symmetrical, self-assembled cages 
form through the arrangement of primary silica clusters in aqueous 
solutions on the surface of oppositely charged surfactant micelles. 
This discovery paves the way for nanoscale cages made from silica 
and other inorganic materials to be used as building blocks for a 
wide range of advanced functional-materials applications. 

In our search for dodecahedral silica-cage structures (Fig. 1), we 
began with the early stages in the self-assembly of silica structures 
directed by surfactant micelles'*. Our synthesis system contained 
cetyltrimethylammonium bromide (CTAB) surfactant micelles 
and tetramethyl orthosilicate (TMOS) as a sol-gel silica precursor 
(see Methods). We added hydrophobic mesitylene (TMB) into the 
aqueous CTAB micelle solution, increasing the size and deformability 
of the micelles!®. We selected TMOS as the silica source because of 
its fast hydrolysis rate in water, and we adjusted the initial reaction 
pH to about 8.5. When TMOS was added, its hydrolysis to silicic 
acid reduced the reaction pH to neutral!®!’. The lowered pH 
accelerated silane condensation, leading to primary silica clusters!* with 
diameters of around 2 nm. The negatively charged silica clusters were 
attracted to the positively charged CTAB micelle surface, assembling 
into micelle-templated nanostructures". This experimental design—in 
which fast hydrolysis and condensation of the silica precursor quickly 
terminated the reaction—allowed the early stages in the micelle- 
directed self-assembly of silica structures to be preserved'”. 

In order to improve the dispersity of particles on transmission 
electron microscopy (TEM) grids, we added low-molar-mass, silane- 
modified monofunctional polyethylene glycol (PEG) into the solu- 
tion one day before preparing TEM samples. This process covalently 
coated the accessible silica surface’, yielding PEGylated nanoparticles 
(Extended Data Fig. 1) that could be further purified and isolated from 
the synthesis solution. Using TEM, we observed particles of narrow size 
distribution, with an average diameter of around 12 nm (Fig. 2a, includ- 
ing inset), consistent with silica structures wrapped around TMB- 
swollen CTAB micelles'®. The detailed particle structure was difficult 
to identify, however. Therefore, we plasma-etched TEM samples on 
carbon grids for five seconds before imaging in order to remove 


excess organic chemicals (such as PEG-silane) that would otherwise 
contribute to background noise. To further improve the signal-to-noise 
ratio, we acquired and averaged a series of images of the same sample 
area. Stripes and windows in zoomed-in images of individual particles 
became more clearly recognizable, suggesting the presence of cage-like 
structures (Fig. 2b, including insets). 

Analysis of thousands of such single-particle TEM images revealed 
the prevalence of two cage projections, one with two-fold symmetry 
and one with five-fold symmetry (Fig. 2c)—too few to allow a 
successful 3D reconstruction. We therefore shifted our attention to cryo- 
electron microscopy (cryo-EM) characterization of the native reaction 
solution. We omitted the silica-surface PEGylation step, as the high 
PEG concentration substantially increased the sensitivity of the samples 
to radiation, resulting in difficulties in obtaining clear cryo-EM images. 

Cryo-EM provided direct visualization of particles in solution with 
arbitrary orientation—that is, without disturbances resulting from 
drying of the samples on TEM substrates, including structure deflation. 
The background noise was much reduced because of the absence of a 
TEM substrate and of chemicals dried onto the substrate during sample 
preparation (Fig. 2c). Although particle aggregation was observed 
occasionally through cryo-EM (Fig. 2d), individual silica nanoparticles 
with cage-like structures could always be identified (Fig. 2c, d). No par- 
ticle aggregation was observed in dry-state TEM of PEGylated particles, 
suggesting that particle aggregation observed by cryo-EM wasa reversible 
process that could be overcome via insertion of PEG chains. 

We manually identified around 19,000 single-particle images from 
cryo-EM micrographs, then clustered them and averaged the images 
in each cluster in order to improve the signal-to-noise ratio”®. The 
averages showed different orientations of silica nanoparticles with 
cage-like structures (silicages; Extended Data Fig. 2a). We identified 
averages that were consistent with selected projections of a pentagonal 
dodecahedral cage (Fig. 2c and Extended Data Fig. 2b). The dodec- 
ahedral silicage (icosahedral point group, I}; Fig. 1) is the simplest of 
a set of Voronoi polyhedra suggested to form the smallest structural 


Fig. 1 | Representations of a dodecahedron. Left, among the platonic 
solids, the dodecahedron best fills out its circumscribed sphere—that 
is, a sphere that passes through all of its vertices. Right, the inscribed 
sphere that passes through all of the dodecahedron’s facets is shown for 
comparison. 
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Fig. 2 | TEM and cryo-EM characterizations of silicages. a, Low- 
magnification TEM images of PEG-coated silicages on a carbon substrate. 
The inset is a zoomed-in image. b, Averaged TEM image obtained using 
11 images of the same sample area of PEG-coated silicages; insets show 
representative individual structures at a higher magnification. The sample 


units of multiple forms of mesoporous silica”!. Although such highly 
symmetrical ultrasmall silica cages have, to our knowledge, never been 
isolated before, it had seemed likely that this should be possible. 

Guided by this structural insight, we carried out single-particle 
3D reconstructions of silicages using the ‘Hetero”” model-based 
maximum likelihood machine-learning algorithm, in which a two- 
class reconstruction is computed to overcome challenges associated 
with structural heterogeneity (see Methods) and rotational icosahedral 
symmetry is imposed on both classes (Extended Data Fig. 2). One of 
the two-class reconstructions was a dodecahedral cage, as visualized by 
UCSF Chimera (ref. **; Fig. 3a, b). We identified a low-intensity signal 
inside this reconstructed cage, consistent with the presence of TMB- 
swollen CTAB micelles within the silicage, whose electron density is 
lower than that of silica but higher than that of the surrounding ice. The 
other class of reconstruction did not provide an interpretable structure, 
probably owing to heterogeneity in the structure of the corresponding 
particles. We carried out such two-class reconstructions using different 
numbers of single-particle images (2,000, 7,000 and 10,000) and found 
consistent results. We further performed single-class reconstructions 
with the Hetero algorithm, using only images from the class that 
showed dodecahedral cages in two-class reconstructions. Equivalent 
two-class and single-class reconstructions were also performed with 
the widely used RELION 2.1 system?*”°. Dodecahedral cage structures 
were obtained in all of these reconstructions (Extended Data Fig. 3 and 
Supplementary Videos 1, 2). The resolution of the reconstructions was 
approximately 2 nm (ref. 7°; Extended Data Fig. 4). Silica in these cages 
is amorphous at the atomic level, which prevented atomic resolution 
of these reconstructions. 

The Hetero reconstruction algorithm provided estimates of the 
projected orientation (that is, three Euler angles) for each experi- 
mental image, which we used to compute predicted projections. Nine 
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Simulated 
projection 


3D model 


was plasma etched for five seconds before TEM characterization to reduce 
background noise. c, Comparison of silicages observed by TEM and 
cryo-EM with projections of simulated dodecahedral cages and models. 
d, Cryo-EM images of silicages without PEG coating. Scale bars in the 
insets of b-d represent 5 nm. 


predicted projections and corresponding experimental images were 
manually clustered, and averages were computed for each cluster 
(Fig. 3c). The similarity of the projections of the 3D reconstruction 
and the averaged experimental images supports the dodecahedral cage 
structure. Furthermore, we calculated the theoretical probabilities 
of finding each of the nine projections (Fig. 3c) on the basis of 
the assumption that the orientations of silicages in cryo-EM are 
random. We then compared the results with the probabilities observed 
by single-particle 3D reconstruction (Extended Data Fig. 5). The high 
consistency between the theoretical and experimental projection 
probabilities further supports the dodecahedral cage reconstruction. 
At this early point we can only speculate about the exact mechanism 
by which the observed silicage structure is formed, but there are clues 
in the details of the reconstruction. The silica occupying the vertices of 
the dodecahedral cage has a diameter of around 2.4 nm (Fig. 3b)—only 
slightly larger than the diameter of primary silica clusters, being approx- 
imately 2 nm (ref. !8; Extended Data Fig. 6). The interstitial spacing 
between two nearby vertices is estimated to be about 1.4 nm (this is the 
length of an edge, 3.8 nm, minus the diameter of the vertices, 2.4 nm; 
see Fig. 3b), which is much less than the diameter of such clusters. 
Bridges between vertices forming the edges of the dodecahedron are 
substantially thinner than the size of the primary clusters (Fig. 3a, b). 
This suggests that negatively charged primary silica clusters formed 
in solution may start to descend onto the positively charged micelle 
surface, attracted by Coulomb interactions. As more and more silica 
clusters assemble on the micelle surface, their repulsive interactions and 
possible interactions with other micelles may cause them to move to the 
vertices of a dodecahedron. Further silane condensation onto the surface 
of growing clusters may lead eventually to bridge formation, resulting 
in the final observed cage structure (Fig. 3). The origin of icosahedral 
symmetry in viruses has been associated with the energy minimization 
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Fig. 3 | Single-particle reconstruction of the dodecahedral silicage. 

a, b, Reconstruction of the dodecahedral silicage (a), and its three most 
unique projections along the two-fold, three-fold and five-fold symmetry 
axes (b). The average dimensions of silicages in b were estimated from the 
reconstructed dodecahedral silicage. c, Representative comparison of nine 


of two opposing interactions: repulsive interactions associated with 
bending rigidity, and attractive hydrophobic interactions”. In a similar 
way, in addition to electrostatic interactions, deformation of the micelle 
surface around the silica clusters may be another important contributor 


Gold 
Fig. 4 | Cage-like structures with different inorganic compositions. 


a-c, Cage-like nanoparticles similar to silicages were obtained when silica 
was replaced by other materials, including gold (a), silver (b) or vanadium 
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unique projections from the reconstruction and cryo-EM cluster averages 
with projections of a 3D dodecahedral cage model. Corresponding single 
cryo-EM images are shown at the bottom, highlighting the difference 
between the raw data and the reconstruction. The scale bar in c represents 
10 nm. The visualizations in a and b were made using UCSF Chimera”? 


to the free energy in our system. This idea is supported by experiments 
showing that the cage structures do not form in the absence of TMB 
(Extended Data Fig. 7), which is expected to enhance the deformability 
of the micelle surface. 


oe #7 3 
Vanadium oxide 


oxide (c). The insets show zoomed-in images of individual particles 
(top two rows) and averaged images”° (bottom row). The scale bars in all 
insets represent 2 nm. 
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We have found that the self-assembly of ultrasmall cage structures 
directed by surfactant micelles can also occur when using other inor- 
ganic materials that have similarly sized features and similar surface 
chemistry to silica. In preliminary experiments, we replaced silica 
with one of two metals, gold or silver, or a transition metal oxide, 
vanadium oxide. Gold and silver structures were prepared by the 
reduction of metal precursors—gold chloride trihydrate and silver 
nitrate, respectively—in the presence of the micelles (Extended Data 
Fig. 8; see Methods). We used tetrakis(hydroxymethyl)phosphonium 
chloride (THPC) as both the reductant and the capping agent to 
stabilize primary gold and silver nanoparticles and to provide negative 
surface charges”*. By contrast, primary vanadium oxide nanoparticles 
with a native negatively charged particle surface were prepared via 
sol-gel chemistry”’, in a similar way to the synthesis of silicages 
(see Methods). Images of individual particles obtained by TEM 
revealed similar internal structures (Fig. 4). These nanoparticles did 
not appear to be dense but instead showed cage-like structures (com- 
pare Figs. 2 and 4), as corroborated by associated projection averages” 
revealing cages with rotational symmetry (bottom insets in Fig. 4), 
similar to the prevalent projection in case of the silicage (vide supra). 
Therefore, micelle-self-assembly-directed cages like the dodecagonal 
structures described here may not be unique to amorphous silica, but 
may provide direct synthesis pathways to crystalline material cages 
(Extended Data Fig. 9). 

There are several ramifications of our silicage discovery. First, such 
cages are generally considered to be individual structural units from 
which larger-scale mesoporous silica is built in a bottom-up manner>!, 
However, given that a dodecahedron cannot be used to generate a tes- 
sellation of 3D space, other silica cage structures must be required. This 
knowledge should motivate better understanding of the early formation 
pathways of surfactant-directed silica self-assembly, including a search 
for micelle-directed ultrasmall silicages with other structures, and from 
other materials (vide supra). Second, the chemical and practical value of 
this polyhedral structure may prove immense. Given the versatility of 
silica surface chemistry, and the ability to distinguish the inside and out- 
side of the cage via micelle-directed synthesis’’, one can readily conceive 
of cage derivatives of many kinds, which may exhibit unusual properties 
and be useful in applications ranging from catalysis to drug delivery. For 
example, given recent success in the clinical translation of ultrasmall 
fluorescent silica nanoparticles with similar particle sizes and surface 
properties to those described here®*’, one can envisage a range of new 
diagnostic and therapeutic probes with drugs hidden inside the cages. 
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METHODS 


Chemicals and materials. All chemicals were used as received. Cetyltrime- 
thylammonium bromide (CTAB), ammonia (2 M in ethanol), mesitylene (1,3,5 
trimethylbenzene; TMB), tetramethyl orthosilicate (TMOS), gold chloride 
trihydrate (HAuCl,-3H20), silver nitrate (AgNOs), tetrakis(hydroxymethyl) 
phosphonium chloride (THPC), dimethyl sulfoxide (DMSO), acetic acid and 
ethanol were purchased from Sigma-Aldrich. Vanadium oxytriisopropoxide 
was purchased from Alfa Aesar. Anhydrous potassium carbonate (K,CO3) was 
purchased from Mallinckrodt. Anhydrous ethanol was purchased from Koptec. 
Silane-modified monofunctional polyethylene glycol (PEG-silane) with a molar 
mass of around 500 g mol (six to nine ethylene glycol units) was purchased from 
Gelest. Carbon-film-coated copper grids for TEM, and C-Flat holey carbon grids 
for cryo-EM, were purchased from Electron Microscopy Sciences. 

Synthesis, TEM and cryo-EM characterization of silicages. Silicages were synthe- 
sized in aqueous solution through surfactant-directed silica condensation. 125 mg 
of CTAB was first dissolved in 10 ml of ammonium hydroxide solution (0.002 M). 
Then, 100 jl TMB was added to expand the CTAB micelle size; the water/CTAB/ 
TMB molar ratio was about 1,620/1/2. The solution was stirred at 600 r.p.m. at 
30°C overnight, and then 100 jul TMOS was added. The reaction was then left 
again at 30°C overnight under stirring at 600 r.p.m. 

To prepare cryo-EM samples, we applied 5 il of the native reaction solution 
to a glow-discharged CF-4/2-2C Protochips C-Flat holey carbon grid, which was 
then blotted using filter paper and plunged into a liquid mixture of 37% ethane 
and 63% propane at —194°C using an Electron Microscopy Science plunge freezer. 
Cryo-EM images were acquired on a FEI Tecnai F20-ST TEM operated at an 
acceleration voltage of 200 kV using a Gatan Orius charge-coupled-device (CCD) 
camera. All cryo-EM images used for reconstruction were acquired at the same 
magnification, with a pixel size of 0.16 nm, and nominal defocus was kept between 
1 pm and 2 um. 

To prepare dry-state TEM samples, we added 100 j:l of PEG-silane into the 
reaction solution. The reaction solution was left at 30°C overnight under stirring 
at 600 r.p.m. to surface-modify silicages covalently with PEGs, in order to improve 
their dispersity on TEM grids. Afterwards, 30 1] of the reaction solution was 
dropped onto a copper grid coated with a continuous carbon film, and blotted 
using filter paper. TEM images were acquired using a FEI Tecnai T12 Spirit 
microscope operated at an acceleration voltage of 120 kV. In order to improve the 
signal-to-noise ratio in recorded images, TEM sample grids were plasma etched 
for 5 seconds before TEM characterization, and a series of images was acquired of 
the same sample area, which were then averaged. 

In order to quench individual primary silica clusters formed at the very 
early stages of cage formation, we added 100 11 of PEG-silane into the reaction 
solution about three minutes after the addition of TMOS. The remainder of the 
procedures—including particle synthesis, dry-state TEM sample preparation, and 
TEM characterization—were as described above. 

Particle purification. In order to remove CTAB and TMB from the cages, after adding 
PEG-silane and stirring at 30°C for a day (see above), we heat-treated the 
solution at 80°C overnight to further enhance the covalent attachment 
of PEG-silane to the silica surface of the silicages. The PEGylated nanocages were 
dialysed (with a molecular weight cut-off, MWCO, of 10 kDa) in a mixture of 
acetic acid, ethanol and water (volume ratio 7/500/500) for three days, and then 
in deionized water for another three days!’. In both cases the dialysis solutions 
were changed once per day. The dry-state TEM sample preparation and TEM 
characterization methods were as described above. 

Synthesis of particles without TMB. The synthesis and TEM characterization 
methods used for particles without TMB were the same as those for particles with 
TMB, as described above, except that the TMB addition step was omitted. 
Silicage surface area and yield of production. We assessed the specific surface 
area of the silicages through a combination of nitrogen sorption measurements 
and theoretical estimations. After the synthesis and purification of PEGylated 
silicages, particles were first upconcentrated using a spin filter (Vivaspin 20, 
MWCO 10 kDa) and dried at 60°C. Particles were then calcined at 550°C for 
6 hours in air. The production yield was then estimated by dividing the remaining 
weight after calcination, that is, the weight of inorganic silica, by the theoretical 
weight of silica, that is, the weight calculated on the basis of the amount of silica 
source added into the synthesis. Nitrogen adsorption and desorption isotherms 
were acquired using a Micromeritics ASAP 2020 (Extended Data Fig. 1c), yielding 
a specific surface area of 570 m” g~! by the Brunauer—Emmett—Teller (BET) 
method. For comparison, using the dodecahedral cage model with the dimensions 
from the reconstruction shown in Fig. 3, we estimated the theoretical surface area 
of silicages to be around 790 m? g~!. The lower experimental value is consistent 
with the expected losses of surface area during sample calcination. 

Synthesis and TEM of metal cage-like structures. The gold and silver cage-like 
structures were prepared by the reduction of metal precursors—HAuCly-3H2O and 
AgNOs, respectively—in the presence of micelles with the same water/CTAB/TMB 
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ratio as for the silicage work. In a typical batch, 50 mg of CTAB was dissolved in 
4 ml of water at 30°C, then 40 jl of TMB and 200 jul of ethanol were added to the 
mixture. After stirring the reaction at 30°C overnight at 600 r.p.m., 16 il of either 
HAuCly-3H20 (25 mM) or AgNO; (25 mM) was added, followed after 5 min by 
8 jl of THPC (68 mM). After another 5 min, 6 iil of potassium carbonate (0.2 M) 
was added. 

Dry-state TEM samples of gold and silver cage-like structures were prepared 

after one day and 6 hours of reaction, respectively, owing to the different reaction 
rates as described in further synthesis Methods sections below. In both cases, the 
samples were prepared by drying 8 1] of the native reaction mixture diluted three 
times in ethanol on a TEM grid in air overnight. In order to remove the thick CTAB 
layer before imaging, the grid was immersed in ethanol for 2 min and then dried 
in air. TEM images of metal cage-like structures were acquired using a FEI Tecnai 
T12 Spirit microscope operated at an acceleration voltage of 120 kV. 
Synthesis and TEM of metal oxide structures. Vanadium oxide cage-like struc- 
tures were prepared on the basis of sol-gel chemistry in a similar way to the 
silicages, using vanadium oxytriisopropoxide as the precursor. In a typical batch, 
50 mg of CTAB was dissolved in 4 ml of water at 30°C, and then 40 jl of TMB were 
added. After stirring the reaction at 30°C overnight at 600 r.p.m., 50 jl of vanadium 
oxytriisopropoxide diluted in 100 sl of DMSO were added. 

Dry-state TEM samples for vanadium oxide cage-like structures were prepared 
after one day of reaction by drying on a TEM grid 8 1] of the native reaction mix- 
ture, diluted ten times in water. At such a dilution, the amount of CTAB was low 
enough that the TEM samples did not require any plasma cleaning or soaking in 
ethanol before imaging. The TEM images of vanadium oxide cages were acquired 
using a FEI Tecnai T12 Spirit microscope operated at an acceleration voltage of 
120 kV. 

Particle reconstruction. We used the ‘Hetero’ model-based maximum likelihood 
machine-learning algorithm”? for particle reconstruction. This algorithm can 
simultaneously estimate: (1) a reconstruction for each type of particle shown in 
the images; (2) the type of particle shown in each image; and (3) the projection 
orientation for each image. Such a joint estimation is a central feature of the algo- 
rithm and is a natural approach for processing data from complicated mixtures. 
The estimates in (2) and (3), which are based on 3D structure, are independent 
of the clustering of 2D images, which is based on pixel values (see, for example, 
Extended Data Fig. 2). In addition to the Hetero algorithm, we applied the widely 
used RELION 2.1 system** to compute equivalent two-class and single-class 
reconstructions. The images were corrected for the contrast transfer function 
(CTE) by phase flipping. 

Further details of metal- and metal-oxide-based cages. In contrast to the sol-gel 
reaction that produces the silicage, synthesis of the gold and silver cage-like structures 
relies on reduction reactions. To this end, we used THPC because it reacts in water 
at basic pH to form trimethoxyphosphine, which can be both the reductant and 
the capping agent for the metal nanoparticles. THPC has been widely used to 
synthesize ultrasmall (less than 3 nm) and negatively charged phosphine-stabilized 
gold nanoparticles”®, These nanoparticles are often used as seeds for the subsequent 
growth of continuous gold shells on the surface of aminated silica nanoparticles 
thanks to their high affinity and binding efficiency with amine groups*!*”. 
Alcohol was added to the reaction mixture in order to mimic the conditions of the 
silicage synthesis, where methanol is formed upon hydrolysis of TMOS. Early-stage 
preliminary experiments showed that the resulting metal structures improved in 
size dispersity when using ethanol at a slightly higher concentration than that of 
the released methanol in the silicage synthesis. Gold and silver cage-like structure 
syntheses were performed at a much lower concentration (silver or gold 
concentration = 93.7 \.M) than that of the silicages (silica precursor 
concentration = 65.9 mM). Attempts at synthesizing gold and silver cages at 
higher concentrations resulted in much larger nanoparticles with no apparent 
internal structure. 

Gold-based synthesis. The addition of gold precursor to the reaction initially 
resulted in the formation of a pale yellow precipitate that turned into a clear (that 
is, non-turbid), darker-orange solution within a couple of minutes under stirring 
at 30°C (see Extended Data Fig. 8 for a survey of the absorption characteristics at 
each step of the synthesis). Given that neither the precipitate nor the darker-orange 
colouration was observed in the absence of CTAB, we attribute these observations 
to some interaction between the gold chloride anions and the ammonium groups 
of the CTAB. After the addition of THPC, the solution turned colourless within a 
couple of minutes, indicating that gold(111) had been reduced to gold(1). The sub- 
sequent transformation of THPC into trimethoxyphosphine, following an increase 
in pH through the addition of potassium carbonate, happened within the first hour 
of reaction (see also the description of silver-based synthesis below). However, the 
reduction from gold(1) to gold(0) was rather slow, with the first hint of colouration 
appearing after 8 hours of reaction. After one day of reaction, the solution exhibited 
a brown colouration. This colouration is the signature of gold nanoparticles that 
are too small or not dense enough to exhibit a strong surface plasmon resonance, 
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as shown by the absorption profile in Extended Data Fig. 8, which shows only a 
faint feature around 510 nm. 

Silver-based synthesis. The addition of silver precursor to the reaction did not 
initially translate into any visible effects, either in the presence of CTAB/TMB 
or after adding THPC. Nevertheless, after adding potassium carbonate to the 
silver-based synthesis, the solution started to turn pale yellow within the first hour 
of reaction and resulted in an intense yellow colouration after 6 hours, at which 
point we prepared the TEM samples. This yellow colouration is typical of small 
silver nanoparticles with a surface plasmon resonance centred around 420 nm 
(Extended Data Fig. 8). 

Vanadium-oxide-based synthesis. The vanadium oxide cage-like nanoparticles 
were prepared under the same conditions as the silicages; however, the pH was not 
adjusted with ammonia owing to the fast hydrolysis and condensation rate of the 
vanadium oxide precursor. Unlike in the synthesis of metal cage-like nanoparticles, 
no alcohol was added here, because the hydrolysis of the vanadium precursor— 
vanadium oxytriisopropoxide—produces alcohol at concentrations similar to those 


used in the silicage synthesis. The addition of this precursor to the TMB micelles 
resulted in the immediate formation of a red precipitate. Under stirring, the 
precipitate dispersed homogeneously in solution, which remained turbid, and 
turned orange after one day of reaction at 30°C. 

Data availability. Figures that have associated raw data are Figs. 2-4 and Extended 
Data Figs. 1-4, 5c and 6-9. There are no restrictions on data availability. The data 
sets generated and analysed during this study are available from the corresponding 
author on reasonable request. 

Code availability. The custom code and algorithm used for the 3D reconstruction 
and related analysis are available from the corresponding author on reasonable 
request. 
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Extended Data Fig. 1 | PEGylated silicages after cleaning, and nitrogen 
sorption measurements on calcined cages. a, b, Representative dry-state 
TEM images, at different magnifications, of PEGylated silicages after the 
removal of surfactant (CTAB) and TMB (see Methods). The insets in a 
(black arrows) and b reveal cage-like structures, suggesting structure 
preservation after the removal of CTAB and TMB. c, Nitrogen adsorption 
and desorption isotherms of calcined silicages. After CTAB and TMB 
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were removed, particles were calcined at 550°C for 6 hours in air before 
nitrogen sorption measurements were taken. A particle synthesis yield of 
67% was estimated from the weight of the calcined powder. The surface 
area of calcined silicages, as assessed by the Brunauer—Emmett—Teller 
(BET) method, was 570 m? g“', consistent with theoretical estimations 
(Methods). Scale bars in the insets in b represent 5 nm. 
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a b Cluster Simulated 3D 
average projection model 
Extended Data Fig. 2 | Cluster averages of two-dimensional images of exhibited features similar to those of simulated projections of 
silicages. a, Around 19,000 single-particle cryo-EM images were sorted dodecahedral cage structure. Also shown are projection models. Scale bars 
into 100 clusters”°. b, Some of the projections (examples highlighted in a) represent 10 nm. 
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Two-fold Three-fold Five-fold 
Extended Data Fig. 3 | Reconstruction of a silicage using the RELION axes. The reconstruction was obtained from a single-class calculation run 
2.1 system. a, Reconstructed dodecahedral silicage”*. b, Its three most by RELION 2.1, using the same set of single-particle images as for the 


unique projections, along the two-fold, three-fold and five-fold symmetry  dodecahedral cage in Fig. 3a. Visualization is by UCSF Chimera”’. 
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Extended Data Fig. 4 | A typical contrast transfer function, and 
determination of reconstruction resolution. We used CTFFIND4.1.8 
(ref. 7°) to estimate defocus for individual micrographs or a set of 
micrographs, with results consistent with nominal defocus values of 
1-2 m. a, Contrast transfer function (CTF) for a defocus of 1.98 jum. 
Given that the first zero-crossing of the CTF occurs at 0.44 nm~!, the 
CTF has little effect on reconstructions unless the resolution is greater 
than 1/(0.44 nm7), that is 2.27 nm. b, A Fourier shell correlation (FSC)** 
computed by a standard package”” for two Hetero reconstructions that are 
independent, starting at the level of separate sets of images each containing 
2,000 images (that is, ‘gold standard’ FSC). The resolution implied by 
the FSC curve (the inverse of the value of spatial frequency where the 
FSC curve first crosses 0.5) is 1/(0.99 nm~'), that is 1.01 nm. c, Energy 
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function for the same pair of reconstructions as in panel b. The energy 

is the spherical average of the squared magnitude of the reciprocal-space 
electron-scattering intensity, where the denominator of FSC is the square 
root of a product of two energy functions, one for each reconstruction. 
The observation that energy has dropped by more than 10~° times its 
peak value, and that the character of the curve has become oscillatory and 
more slowly decreasing—both by 0.44 nm~!—indicates that the resolution 
implied by the FSC curve is exaggerated” and that a more conservative 
resolution is 1/(0.44 nm), that is 2.27 nm. d, FSC computed by a 
standard package” for two RELION 2.1 reconstructions computed from 
the same images as those in panel b, from which the resolution 

(at 0.5 threshold) is estimated to be around 1/(0.50 nm~), that is 2.00 nm. 
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Extended Data Fig. 5 | Probability analysis of silicage projections. 

a, Orientation dependence of silicage projections. Right panel, the nine 
different silicage projections identified by 3D reconstruction (Fig. 3) 

are shown. Left panel, these orientations are manually mapped onto the 
surface of a dodecahedron. The orientations corresponding to different 
projections are assigned different colours. b, Probability analysis for 
different silicage projections. The probability of imaging a particular 
projection by electron microscopy is estimated by dividing that subset 

of the surface area of a sphere that contains the orientations corresponding 
to a specific projection, by the total surface area of the sphere. 

c, Experimental probability of imaging different silicage projections. 

The probability of imaging each projection is calculated by dividing the 
number of single-particle images assigned to a specific silicage projection 
via 3D reconstruction, by the overall number of silicage single-particle 
images. The error bars in c are standard deviations calculated from three 
projection distributions, which were obtained from three independent 
reconstruction runs using different sets of single-particle images. 
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Extended Data Fig. 6 | Size analysis of silica clusters at an early stage of 
cage formation. Particle-size distribution for primary silica clusters at an 
early stage of cage formation, obtained by manually analysing 450 silica 
clusters using a set of TEM images. The measured silica clusters 

were randomly split into three groups, each containing 150 particles. 

A cluster-size distribution was then obtained for each of the three groups, 
and the results were averaged. The error bars are standard deviations 
calculated from the three cluster-size distributions. A representative TEM 
image is included at the right. In order to quench the very early stages of 
cage formation, PEG-silane was added into the synthesis mixture about 
three minutes after the addition of TMOS, thereby PEGylating early 
silica structures. TEM sample preparation and characterization were 

as described in the Methods. Primary silica clusters with diameters of 
around 2 nm were identified, consistent with the proposed cage-formation 
mechanism. 
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Extended Data Fig. 7 | Role of TMB in cage formation. a, b, TEM suggesting that these nanoparticles do not exhibit a hollow cage-like 
images, at different magnifications, of silica nanoparticles synthesized with structure but instead are conventional mesoporous silica nanoparticles 
(a) and without (b) TMB. Nanoparticles synthesized without TMB (b) with relatively small particle sizes. 


show stronger contrast at the particle centres than do the nanocages (a), 
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Extended Data Fig. 8 | Optical characterization of gold- and silver- 
based synthesis solutions. a, Survey of the gold-based synthesis, showing 
the absorption profile of solutions after the successive addition of HAuCl, 
(orange) and THPC (blue) and then one day after the addition of K,CO3 
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(red); also shown is the absorption profile when the same concentration of 
HAuCl, is added to the equivalent water/ethanol solution but without any 
CTAB or TMB (green). b, Absorption profile of a solution obtained from 
the silver synthesis 6 hours after the addition of K,CO3. 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


Extended Data Fig. 9 | High-resolution TEM image of a single cage-like 
gold nanoparticle. This gold particle exhibits lattice fringes with a spacing 
of 2.3 A, consistent with the known lattice spacing between (111) planes 
of gold (Joint Committee on Powder Diffraction Standards no. 04-0784, 
http://www.icdd.com/). Scale bar represents 5 nm. 
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Enantioselective remote meta-C-H arylation and 
alkylation via a chiral transient mediator 


Hang Shi!, Alastair N. Herron!, Ying Shao!, Qian Shao! & Jin- Quan Yu!* 


Enantioselective carbon-hydrogen (C-H) activation reactions 
by asymmetric metallation could provide new routes for the 
construction of chiral molecules!”. However, current methods 
are typically limited to the formation of five- or six-membered 
metallacycles, thereby preventing the asymmetric functionalization 
of C-H bonds at positions remote to existing functional groups. Here 
we report enantioselective remote C-H activation using a catalytic 
amount of a chiral norbornene as a transient mediator, which 
relays initial ortho-C-H activation to the meta position. This was 
used in the enantioselective meta-C-H arylation of benzylamines, 
as well as the arylation and alkylation of homobenzylamines. The 
enantioselectivities obtained using the chiral transient mediator are 


comparable across different classes of substrates containing either 
neutral o-donor or anionic coordinating groups. This relay strategy 
could provide an alternative means to remote chiral induction, one 
of the most challenging problems in asymmetric catalysis**. 
Enantioselective activation of C(sp?)-H bonds has played 
an important role in the development of chiral ligands and the 
understanding of chiral induction in the metallation of C-H 
bonds. An early example of Ru(o)-catalysed atropselective 
alkylation of 2-arylpyridine® afforded only 49% enantiomeric 
excess (e.e.). Over the past decade, there has been an extensive 
search for suitable metal catalysts, chiral ligands and catalytic 
cycles that can achieve highly enantioselective C-H activation 
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Fig. 1 | Enantioselective C(sp”)-H activation. a, Enantioselective ortho- 
C-H activation. b, Enantioselective remote C-H activation, which is 
currently an unsolved problem. c, A strategy for enantioselective remote 


Pay, COsMe aryl/alkyl 


Desymmetrization: up to 83% yield, 99:1 er 

Kinetic resolution: up to 37 s 
C-H activation. d, The scope of enantioselective remote C-H activation. 
Ar, aryl group; DG, directing group; FG, functional group; [M], transition- 
metal catalyst. 
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Fig. 2 | Enantioselective meta-C-H arylation of diarylmethylamines. 
a, CTM and additive optimization. Reaction conditions: 10 mol% 
Pd(OAc)s, 15 mol% ligand, 50 mol% CTM, 15 mol% additive, 1 

equiv. substrate (R = 3-Me), 3 equiv. methyl 4-iodobenzoate, 3 equiv. 
AgOAc, CHCl, 100°C. Yields reported are based on NMR. b, The 
scope of asymmetric arylation. Isolated yields are reported. The exact 
reaction conditions for each substrate can be found in the Supporting 


reactions. This led to the finding that a Pd(11) catalyst bound to a 
chiral monoprotected amino acid ligand enables highly enantio- 
selective C-H coupling via a Pd(11)/Pd(o0) catalytic cycle®!°, as well 
as the development of diverse transformations involving Pd(11)/Pd(1v) 
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2r (R = CH2NHAc), 70%, 95:5 e.r. 
2s (R = OMe), 67%, 97:3 e.r. 

2t (R = F), 65%, 97:3 e.r. 

2u (R = Cl), 64%, 95:5 e.r. 

2v (R = Br), 73%, 96:4 e.r. 

2w (R = CN), 69%, 90:10 e.r. 


m-Tol 


2ad (R = 2-CO2Me, 5-Cl), 83%, 98:2 e.r. 

2ae (R = 2-NOp, 4-OMe), 70%, 96:4 e.r. 

2af (Ar: B-naphthalene), 66%, 95:5 e.r. 

2ag (R = 3-CF3, 5-CF3), 66%, 99:1 e.r. 

2ah (R = 3-SO2Me, 5-SO2Me), 80%, 96:4 e.r. 
2ai (R = 3-F, 4-F, 5-F), 75%, 96:4 e.r. 


2an, 51%, 90:10 e.r. 


2am, 65%, 89:11 e.r. 2a0, 47%, 84:16 er. 


oS ad ys LOD 


2ar, 66%, 89:11 e.r. 


2as, 40%, >99:1 er. 2at, 70%, 92:8 e.r. 2au, 59%, 94:6 e.r. 


Information. The absolute configuration of 2ah was determined by X-ray 
crystallography. Reducing the Pd(OAc)2 to 5 mol% produced comparable 
results for substrate 1a’ (see Supplementary Table 6). Ad-CO,H, 
1-adamantanecarboxylic acid; Ar(Het), (heterocyclic) aryl group; BNDHP, 
1,1'-binaphthyl-2,2’-diyl hydrogen phosphate; Boc, tert-butyloxycarbonyl 
group; equiv., equivalents; TFA, trifluoroacetic acid; TIPS, triisopropylsilyl 
group; Tol, tolyl group. 


catalysis!” (Fig. 1a). Recently, a number of enantioselective C-H 
activation reactions using chiral Pd(o)!*!4, Rh(1)16, Rh(i1)!7 and 
Ir(1)'8 catalysts have been reported. However, transition-metal- 
catalysed enantioselective remote meta-C(sp”)—H activation has 
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Fig. 3 | Enantioselective meta-C-H activation of homobenzylamines. 
a, The scope of desymmetrization. b, The scope of kinetic resolution. 
The exact reaction conditions for each substrate can be found in 

the Supporting Information. Isolated yields are reported. The absolute 
configurations of 4m and 4n were determined by X-ray crystallography. 


hitherto not been achieved'!**”. Asymmetric metallation of remote 
C(sp”)-H bonds presents a distinct challenge: the metal insertion 
event occurs too far from the carbon that becomes the chiral centre. 
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Reducing the catalyst loading to 5 mol% produced comparable results for 
substrate 3a (see Supplementary Table 10). c, conversion; s, selectivity; 
TBME, tert-butylmethylether; TBS, tert-butyldimethylsilyl group. *Aryl 
or alkyl iodide as coupling reagent. "Methyl 2-bromobenzoate as coupling 
reagent. 


Remote C-H functionalization directed by a nitrile template has been 
developed recently”!; however, control of the stereochemistry through 
a conformationally flexible macrocyclic transition state remains to 
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be demonstrated (Fig. 1b). To address this challenge, we devised a 
strategy using a catalytic amount of chiral transient mediator (CTM) 
to achieve enantioselective remote C-H activation (Fig. 1c). The CIM 
has a dual role: to relay C-H activation from the ortho position to the 
remote meta position”? ~27. and to achieve chiral differentiation of the 
racemic ortho-C-H palladation intermediates, generated by an achiral 
catalyst. The use of chiral norbornene methyl (1S,4R)-bicyclo[2.2.1] 
hept-2-ene-2-carboxylate ((++)- NBE-CO2Me) as the CTM afforded 
high enantioselectivity in remote meta-C-H desymmetrization and 
kinetic resolution (Fig. 1d). 

Our experimental design was based on a previous finding that 
norbornene can intercept the ortho-palladation intermediate via 
migratory insertion and subsequently effect meta-C-H function- 
alizations’*-*°. This reaction mechanism led us to consider that a 
chiral, enantioenriched norbornene might be able to differentiate the 
racemic ortho-palladate intermediate during the alkene insertion, or 
the subsequent alkene-insertion intermediate from the meta-C-H 
palladation step. We selected diarylmethylamine 17°, which contains 
a o-donor coordinating group, as a model substrate to investigate the 
potential asymmetric arylation using chiral norbornene mediators 
(Fig. 2a). Although the chiral bicyclo[2.2.1]heptene derivatives CTM1 
and CTM2 were effective mediators, both gave negligible enantiose- 
lectivity. Likewise, the lactam CTM3 failed to display any degree of 
stereoinduction. The use of chiral (+-)- NBE-CO Me as the transient 
mediator led to a marked improvement in enantioselectivity, giving an 
enantiomeric ratio (e.r.) of 88:12 (see Supplementary Tables 1-6 for 
optimization). Although neither inorganic base nor carboxylic acid 
additives were effective, the phosphoric acid (PhO),PO,H increased 
the enantiomeric ratio to 93:7. With (R)-BNDHP (1,1/-binaphthyl- 
2,2'-diyl hydrogen phosphate) as an additive, a 70% yield and an 
excellent enantioselectivity (97:3 e.r.) were obtained. Bulky substituents 
(such as an anthracenyl group) on the phosphoric acid proved 
detrimental to the reaction. To investigate the role of the chiral 
norbornene and phosphoric acid, we performed a number of 
control experiments. The use of racemic NBE-CO2Me and (R)-BNDHP 
gave poor enantioselectivity (57:43 e.r.). The use of (S)-BNDHP, 
(R)-phosphorylimide or chiral camphorsulfonic acid in combination 
with (+)-NBE-CO2Me led to a slight decrease in yield and enantio 
selectivity. These combined experiments suggest that (+)-NBE- 
CO Me is responsible for the chiral induction and that the chiral 
phosphoric acid has a minor beneficial effect. 

Using the optimized conditions, we then tested the substrate scope 
with methyl 2-iodobenzoate (Fig. 2b). Electron-neutral (2a, 2b), 
electron-donating (2c) and electron-withdrawing (2d to 2f) substituents 
were all well tolerated, providing high enantioselectivities (up to 
97:3 e.x.). Substrate 1g, bearing an alkyne group, gave a lower yield 
owing to the instability of the C=C triple bond under the standard 
conditions. Substituents at the ortho position (2h) reduced the 
reactivity, presumably because ofa steric effect, but high enantioselectivity 
(91:9 e.r.) was preserved. Bis-substituted arenes 2i and 2j, which 
contain heterocycles, were also compatible and consistently high 
enantioselectivity was observed. The five-membered heterocycle 2k 
gave lower selectivity, probably owing to a coordinative effect. 

We next surveyed the scope of aryl iodides (Fig. 2b). The scope of 
ortho-substituted aryl iodides (2m-2gq) is broad, providing high enantio- 
selectivities (up to 97:3 e.r.). Electronically diverse meta-substituted 
aryl iodides (2r-2w) were tolerated, and produced enantiomeric ratios 
of greater than 95:5 (2s—2v). The scope of para-substituted aryl iodides 
is also broad, and electronic factors do not affect the enantioselectivity 
(2x-2ac). Multi-substituted aryl iodides (2ad-2ai) were also toler- 
ated under the conditions, and the enantiomeric ratios all exceeded 
95:5. Among the aryl iodides tolerated were those featuring halogens 
(for example, in 2t-2v) and reactive groups (NHAc in 2m and 2r, 
phosphonate moiety in 2x, alkyl chloride 2z and aldehyde in 2aa) that can 
serve as useful synthetic handles for subsequent chemical manipulation. 
Importantly, heterocyclic aryl iodides also proved successful in the 
reaction (2aj-2au). Pyridine, quinoline, quinazoline, furan, thiophene 
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and indole derivatives were all suitable coupling partners, providing 
good to high enantioselectivities. The simple removal of the directing 
group was demonstrated with product 2ae (see Supplementary 
Information). 

We then investigated the asymmetric meta-C-H activation of 
homobenzylamines”’. To our knowledge, no ortho-C-H activation 
method to date has achieved the construction of a homobenzylic 
chiral centre. Proposing that the challenge might be overcome by the CTM 
strategy, we tested different mediators and found that (+-)-NBE-CO Me 
remained the most effective, giving an enantiomeric ratio of 97:3 with 
nosyl (Ns)-protected homobenzylamine 3a, which contains an anionic 
coordinating group (see Supplementary Tables 7-10). Notably, the use 
of a nosyl-protected amino moiety as the directing group represents 
a practical advantage in synthetic applications. Further investigation 
revealed that this method is compatible with a broad range of symmetric 
homobenzylamines (Fig. 3a). Ortho-, meta-, and para-substituents 
were all well tolerated, and provided the desired products (4a—4f, 4h) 
in good to high enantioselectivities. Moreover, both naphthalene (4g) and 
the bis-substituted arene (4h) were amenable to the standard conditions. 

The reaction tolerated electronically diverse aryl iodides (4i to 41) 
and provided enantiomeric ratios of up to 99:1. Reactions involving het- 
erocyclic aryl iodides such as pyridine, furan and thiophene (4m to 40) 
were also successful. In addition to aryl iodides, 2-bromobenzoate was 
also tested and product 4a was obtained in moderate yield and excellent 
enantioselectivity (>99:1 e.r.). Similarly, the aliphatic coupling partner 
iodoacetate gave product 4p with an enantiomeric ratio of 92:8. 

In addition to desymmetrization, the kinetic resolution of second- 
ary amines was also achieved (Fig. 3b). (-+)- NBE-CO2Me successfully 
resolved a racemic mixture of amino esters; with 2-bromobenzoate, 
meta-arylated products (4q, 4r) were obtained in good enantio- 
selectivity along with the recovery of starting materials. These meta- 
arylated phenylalanines are central motifs in a family of bioactive 
natural products*”. Alkyl- and aryl-substituted homobenzylamines 
(4s-4u) were also compatible with this system. Moreover, simple 
arene (4v), heterocycle (4w) and alkyl groups (4x, 4y) were all 
enantioselectively introduced at the meta position by this method. 
To demonstrate the synthetic utility of this reaction, a copper- 
catalysed intramolecular amination of 4z yielded chiral indoline 5 
with the 5 position substituted, a core structure of the chemotherapy 
medication vincristine. 

In summary, remote enantioselective C-H activation reactions were 
realized by relaying ortho-C-H activation to remote meta-C-H activation 
using a chiral norbornene as the mediator. The chiral amplification is 
achieved by fast, reversible, non-asymmetric C-H activation followed 
by enantioselective norbornene insertion or meta-C-H activation. 
This approach is compatible with substrates containing either neutral 
o-donor or anionic coordinating groups. 
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METHODS 


Mechanistic studies. Our previous mechanistic studies of enantioselective C-H 
activation reactions established the asymmetric C-H metallation step as the 
enantiodetermining step>!?. Because the chiral mediator (++)-NBE-CO2Me 
is not involved in the ortho-C-H activation step, chiral differentiation should 
occur in one of the subsequent steps. A number of mechanistic experiments were 
therefore carried out to provide further insights (see Supplementary Figs. 1-8). 
First, kinetic isotope effects of 1.03 and 1.33 for the ortho- and meta-deuterated 
substrates, respectively, indicated that the C-H activation steps are probably not 
rate-limiting. The first-order rate dependence observed for aryl iodide points to 
the oxidative addition of aryl iodide as rate-limiting (see Supplementary Fig. 7)°°. 
Second, substantial hydrogen—deuterium exchange at the ortho position and the 
lack of such exchange at the meta position suggests that the ortho-C-H activation 
step is fast and reversible whereas the meta-C-H activation is not readily reversible 
on the timescale of the reaction. These combined data suggest that chiral differenti- 
ation by the chiral norbornene occurs either during the norbornene insertion into 
the ortho-palladation intermediate or during the subsequent meta-C-H activation 
step (see Supplementary Fig. 9). 

General procedure for the Pd/(+)-NBE-CO Me catalysed enantioselective 
meta-C-H activation. Arene (0.10 mmol, 1.0 equiv), Pd(OAc), (2.2 mg, 10j1mol, 
10 mol%), ligand (15 mol% or 20 mol%), hydrogen phosphate (15 mol% for 
benzylamine substrate), aryl iodide (0.3 mmol, 3.0 equiv.) and silver acetate 
(50 mg, 0.30 mmol, 3.0 equiv.) were added into a 2-dram reaction vial. Solvent 


and (+)-NBE-CO2Me (20 mol% or 50 mol%) were added to the mixture. The 
vial was flushed with N» and capped. The reaction mixture was then stirred at 
the selected temperature for 12-24h. After cooling to room temperature, the 
mixture was filtered through Celite and eluted with ethyl acetate. The filtrate 
was evaporated under reduced pressure. Purification by preparative thin-layer 
chromatography afforded the desired product. Full experimental details 
and characterization of new compounds can be found in the Supplementary 
Information. 

Data availability. The data supporting the findings of this study are available 
within the article and its Supplementary Information Files. Metrical parameters for 
the structures of 2ah, 4m and 4n are available free of charge from the Cambridge 
Crystallographic Data Centre (https://www.ccdc.cam.ac.uk/) under reference 
numbers 1586807, 1586808 and 1586809, respectively. 
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crystallization and crust formation on Mars 
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The formation of a primordial crust is a critical step in the evolution 
of terrestrial planets but the timing of this process is poorly 
understood. The mineral zircon is a powerful tool for constraining 
crust formation because it can be accurately dated with the uranium- 
to-lead (U-Pb) isotopic decay system and is resistant to subsequent 
alteration. Moreover, given the high concentration of hafnium in 
zircon, the lutetium-to-hafnium (!”°Lu-!”°Hf) isotopic decay system 
can be used to determine the nature and formation timescale of its 
source reservoir’ *. Ancient igneous zircons with crystallization ages 
of around 4,430 million years (Myr) have been reported in Martian 
meteorites that are believed to represent regolith breccias from the 
southern highlands of Mars*°. These zircons are present in evolved 
lithologies interpreted to reflect re-melted primary Martian crust’, 
thereby potentially providing insight into early crustal evolution 
on Mars. Here, we report concomitant high-precision U-Pb ages 
and Hf-isotope compositions of ancient zircons from the NWA 7034 
Martian regolith breccia. Seven zircons with mostly concordant 
U-Pb ages define 7°’ Pb/?°Pb dates ranging from 4,476.3 + 0.9 Myr 
ago to 4,429.7 + 1.0 Myr ago, including the oldest directly dated 
material from Mars. All zircons record unradiogenic initial Hf- 
isotope compositions inherited from an enriched, andesitic-like 
crust extracted from a primitive mantle no later than 4,547 Myr ago. 
Thus, a primordial crust existed on Mars by this time and survived 
for around 100 Myr before it was reworked, possibly by impacts*®, 
to produce magmas from which the zircons crystallized. Given 
that formation of a stable primordial crust is the end product of 
planetary differentiation, our data require that the accretion, core 
formation and magma ocean crystallization on Mars were completed 
less than 20 Myr after the formation of the Solar System. These 
timescales support models that suggest extremely rapid magma 
ocean crystallization leading to a gravitationally unstable stratified 
mantle, which subsequently overturns, resulting in decompression 
melting of rising cumulates and production of a primordial basaltic 
to andesitic crust®”. 

The emergence of a stable primordial crust is a fundamental step in 
the early history of rocky, potentially habitable planets. Primordial crust 
formation is the end product of a long sequence of events, including 
planetary accretion, establishment of a global magma ocean, core for- 
mation and, finally, silicate differentiation. Existing constraints® ” for 
the timing of each of these events allow for primordial crust formation 
in terrestrial planets over timescales of approximately 5 to 100 Myr, a 
range that precludes a full understanding of early planet formation. 
In the Solar System, Mars offers the opportunity to better constrain 
the timing of planet-formation processes, given its relatively simple 
geologic history as a stranded planetary embryo? as well as the wealth 
of information from Martian meteorites and spacecraft exploration”. 
From the meteorite record, the accretion of Mars is inferred to have 


been mostly completed within about 5 Myr of the formation of the Solar 
System®!4, whereas the crystallization of the magma ocean leading to 
the extraction of a primordial crust may have occurred over timescales 
of approximately 30 to 100 Myr after accretion!®!>-!°. However, these 
timescales for silicate differentiation are based on the modelled abun- 
dances of the short-lived '*’Hf and '“°Sm nuclides during planetary 
differentiation inferred from young Martian meteorites and, hence, 
are highly model dependent. 

A more robust approach to dating early planetary differentiation on 
Mars requires the identification of material that formed in the earliest 
evolutionary stages of the planet. On Earth, such a record is preserved 
in the Jack Hills of Western Australia, which contains ancient zircons!” 
as old as about 4,370 Myr. Although zircon is not common in Martian 
meteorites, two recent studies have reported the presence of approx- 
imately 4,430-Myr-old zircons in the Martian regolith breccia NWA 
7533/7034, which is thought to have originated from the southern high- 
lands of Mars*?. The breccia contains clasts that are interpreted to be 
of igneous, sedimentary and impact origin, preserved in a fine-grained 
matrix. Zircons have been identified in the igneous and sedimentary 
clasts as well as within the matrix. Collectively, these grains are likely 
to provide the earliest tangible record of crust formation processes on 
Mars. However, the typical (small) sizes of these zircons prevent us 
from obtaining the concomitant U-Pb ages and Hf isotope composi- 
tions of sufficient precision using in situ techniques. 

We conducted a systematic search for zircons in a bulk crushed 
rock aliquot of the NWA 7034 meteorite. Although this approach does 
not provide a petrological context for individual zircons, it is the only 
means of ensuring the recovery of grains sufficiently large (>30 um) 
to permit high-precision U-Pb chronology and Hf isotope measure- 
ments using solution-based methods. Irrespective of their petrological 
context, these zircons faithfully record information about the nature of 
their source reservoir. This rationale is recognized in studies using the 
Jack Hills detrital zircons to probe the early terrestrial crustal record'”. 
A total of seven grains were extracted and analysed for U-Pb dating and 
Lu-Hf systematics. Their sizes ranged from around 50 j1m to 110 pm 
and they were found to represent different morphologic types, includ- 
ing rounded pieces, irregular anhedral pieces, and euhedral pieces with 
well-defined faces and a flat prismatic shape (Extended Data Fig. 1). 
Common to all of them was the general absence of fractures and inclu- 
sions, as well as any evidence of radiation damage. The zircons returned 
2°7Pb/?°Pb ages ranging from 4,476.3 +E 0.9 Myr to 4,429.7 + 1.0 Myr 
(Table 1). Importantly, five of the seven grains are concordant within 
their stated uncertainties, grain S22b5 is 1.2% discordant and grain 
S24B2 is 5.3% discordant (Fig. 1). The larger degree of discordance 
for grain $24B2 is consistent with its higher U content of 46 parts per 
million (p.p.m.) relative to the other zircons we investigated. On the 
basis of textural information and geological context, the zircons from 
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Table 1 | U-Pb age data and ?76Lu-17Hf systematics of NWA 7034 zircons and the Hf isotope composition of 


the 91500 terrestrial zircon standard 


207Pp/206Ph 207Pp/235U) 206Pph/238y 

Sample age (Myr) age (Myr) age (Myr) Ve_y/77Ht = 176HF/177HE 178Hf/177HF 180H£/177HfE eHfr 

$22b4 4,448.74 1.8 4,445.1+63 4,437.34 22.1 0.000805 0.279891 + 10 1.467184 2 1.88666 + 6 —1.92 + 0.37 
$22b5 44742414 4,457.84 5.7 442194198 0.000799 0.279907 + 09 1.46719+2 1.88666 + 5 —0.71 40.32 
S23b9 4,447.04 1.5 44410448 4,427.7 + 16.5 0.000911 0.279908 + 16 14672142 1.88669 + 6 1.70 + 0.58 
S24b2 4,429.74+1.0 4,355.2 4+ 43 4,195.9 + 12.8 0.001057 0.279922 + 06 1.46719+2 1.88667 + 3 2.06 + 0.26 
S24b4 4,474.0 +08 4,470.1 +29 44614499 0.001055 0.279906 + 06 1.46719+2 1.88667 + 3 1.57 40.26 
S24b7 4,473.9+09 4,476.8 +28 4,483.3 49.5 0.000742 0.279887 + 05 1.4672041 1.88667 + 3 1,27 40.19 
$25b10 4,476.3 +09 4,474.5 + 3.6 4,470.6 + 12.5 0.001191 0.279926 + 07 1.46721 42 1.88669 + 4 1.214030 
Average 1.467204 2 1.88667 + 3 

91500-1 0.282308 + 06 14672141 1.88670 + 3 

91500-2 0.2823144 11 14671842 1.886744 4 

91500-3 0.282311 + 06 14671842 1.88670 +4 

91500-4 0.282311 +05 14671841 1.88668 + 2 

91500-5 0.282309 + 05 14671941 1.88669 + 2 

91500-6 0.282309 + 06 14671841 1.88664 + 2 

91500-7 0.282317 +05 1.46718+1 1.88667 + 3 

Average 0.282311 + 06 1.46718 + 2 1.88669 + 6 


Age uncertainties are 2c. Hf isotope ratios are reported normalized to the composition of the JMC-475 Hf standard. Uncertainties on the Hf isotope ratios reflect the 2-standard-error internal precision 
in the last decimal places. The external reproducibility of the !7°Hf/!77Hf ratio is estimated to be 22 p.p.m., based on the analyses of the seven 91500 zircon aliquots. U-Pb data are reported in full in 


Supplementary Table 1. 


the basaltic breccia NWA 7034/7533 have been interpreted as igneous 
in origin*». Both the ages and the morphologies of the zircons ana- 
lysed here are in line with this interpretation. Although the range of 
ages we report is consistent with earlier studies, the better than tenfold 
improvement in precision allows us to establish that zircon formation 
occurred in multiple igneous events over around 50 Myr. Four zircons 
analysed here define an age cluster at about 4,475 Myr (Fig. 1), which 
is considerably older than the age of about 4,430 Myr inferred from 
earlier studies of Martian zircons. In particular, one concordant zircon 
(S25B10) from this population records an age of 4,476.3 + 0.9 Myr and, 
as such, represents the oldest directly dated material from Mars. This 
age is about 100 Myr older than the oldest dated terrestrial zircons’”, 
implying that the record of crust-forming processes on Mars is clearly 
older than that on Earth. These zircons thus provide a unique insight 
into the earliest history of the planet. 

The distinct geochemical behaviour of Lu and Hf during partial 
melting episodes makes the '”°Lu-!”°Hf decay system a powerful tool 
with which to constrain the timing of planetary silicate differentiation. 
For example, a primordial crustal reservoir will inherit a sub-chondritic 
761 u/!’7Hf ratio (<0.0336, ref. 18) such that its time-integrated Hf 
isotopic composition will be less radiogenic than the chondritic 
uniform reservoir. We have determined the Hf isotope composition 
and Lu/Hf ratios of the seven U-Pb dated zircons using multiple 
collection inductively coupled plasma source mass spectrometry 
(MC-ICPMS)!?”°. The more than fourfold improvement in precision 
afforded by this method over in situ techniques (see, for example, 
ref, *') is required to probe early differentiation timescales, given the 
small amount of radiogenic ingrowth of !”°Hf in the first 100 Myr 
of the Solar System. Moreover, this approach guarantees that the Hf 
isotope compositions were measured for the same volume of zircon 
for which the ages were determined, thereby ensuring that the initial 
Hf isotope compositions are accurately time-corrected. Importantly, 
the age variability of about 50 Myr recorded by the zircons 
allows us to track the isotopic evolution of their source reservoir. 
The seven zircons have unradiogenic initial eHf values ranging from 
-0.71 + 0.32 to -2.06 + 0.26 (Table 1; the cHf value is the devia- 
tion of the '”°Hf/!”’Hf ratio of a sample from the chondritic uniform 
reservoir, in parts per 10%), indicating that these grains formed from a 
precursor reservoir with a sub-chondritic !”°Lu/'”’Hf ratio (Fig. 2a). 
The initial eHf values are correlated with their 7°’Pb/?°°Pb ages, sug- 
gesting that these grains were ultimately derived from a common 
source reservoir. The stable '7Hf/'”’Hf and '®°Hf/'77Hf ratios of 
the NWA 7034 zircons are identical to those of terrestrial zircons 
(Table 1), establishing that the reported cHf values are unaffected by 
neutron-capture effects. 


The minimum model formation age for the source reservoir of the 
zircon population investigated here can be estimated from the oldest 
grain that records the most unradiogenic initial Hf isotope compo- 
sition. Three of the four zircons that define the age cluster at about 
4,475 Myr ago have concordant U-Pb ages and, as such, are inferred to 
have undisturbed !”*Lu-!”*Hf systematics. Indeed, these three zircons 
(S24b4, $24b7 and $25b10) record identical initial ceHf values within 
uncertainty despite having different !”°Lu/!”’Hf ratios. Therefore, the 
average initial ¢Hf value of these three concordant grains (-1.35 + 0.22) 
provides a robust estimate of the Hf isotope composition of their 
protolith at around 4,475 Myr ago and, hence, the formation age of 
their source crustal reservoir. Given that the available geochemical 
data suggest that the bulk of the exposed Martian crust is of basaltic 
composition!*2, we assess whether this type of crust could represent the 
source reservoir of the ancient zircon population. Using a !”*Lu/!”7Hf 
ratio typical of mafic crustal sources on Earth (0.020, ref. 7°) returns 
impossibly old ages (>4,567 Myr), requiring the existence of a crustal 
reservoir with a composition more evolved than basaltic to account 
for the initial cHf value of the approximately 4,475-Myr-old zircons. 
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Fig. 1 | U-Pb concordia diagram for seven zircon grains from the 

NWA 7034 meteorite. Labels on the concordia curve represent time 
before present. Data point error ellipses are 2c. Data used in this figure are 
reported in full in Supplementary Table 1. 
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Fig. 2 | Hf isotope evolution diagrams. a, The initial ¢Hf values (¢Hfr) for 
the seven individual NWA 7034 zircons calculated with their corresponding 
207Pb—*Pb ages using a A!”°Lu value of 1.867 + 0.008 x 107"! yr7! (ref. 3°) 
and chondritic uniform reservoir parameters of ref. '*. The upper boundary 
of the forbidden region represents a reservoir with !”Lu/!””7Hf=0 anda 
formation age defined by the age of the Solar System?! at 4,567 Myr such that 
no data should plot in this field. b, The time evolution of basaltic and 
andesitic crustal reservoirs required to account for the average initial Hf 
isotope compositions of the three concordant 4,475-Myr-old zircons (S24b4, 
$24b7 and $25b10) using !”°Lu/!”’Hf ratios of 0.020 and 0.011 for the 
basaltic and andesitic crusts, respectively”**°. Considering the upper 
uncertainty of the zircon average cHfr value (-1.35 + 0.22), it is not possible 
to account for the initial Hf isotope composition of these grains if they 
formed from the reworking of a basaltic crust, because extraction ages older 
than the Solar System are required. By contrast, using a more evolved, 
andesite-like '”°Lu/'”’Hf ratio returns a minimum extraction age of 

4,547 Myr. Using the mean of the concordant grains at face value and a 
176L u/*77Hf ratio of 0.011 yields an extraction age of 4, 562*?, Myr. We note 
that the time evolution of this reservoir can account for the Hf isotope 
composition of the younger, approximately 4,450-Myr-old and 4,430-Myr- 
old zircons. Indeed, a regression of the mean of the approximately 
4,475-Myr-old, 4,450-Myr-old and 4,430-Myr-old zircons yields a slope 
corresponding to an andesite-like !”°Lu/'7’Hf ratio of 0.011. Uncertainty on 
the cHf values reflects the internal precision (2 standard errors) or the 
external reproducibility of 22 p.p.m., whichever is larger. Uncertainty on the 
2°7Pb/?SPb ages (2c) are smaller than symbols. 


One possibility is an andesite-like source, because rocks with such 
evolved compositions have been identified on Mars based on in situ 
observations”*. Moreover, some magma ocean crystallization models®” 
predict basaltic-to-andesitic compositions for the primary Martian 
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crust produced by decompression melting of rising cumulates, follow- 
ing overturn of the gravitationally unstable stratified mantle. We note 
that recent estimates that suggest a low crustal bulk density for Mars 
(2,582 + 209 kg m7’, ref. 7°) could, in principle, also be consistent with 
a more evolved average crustal composition. Using an andesite-like 
SL u/!”’Hf ratio of about 0.011 estimated from terrestrial rocks”*® 
defines a minimum formation age of 4,547 Myr for the source reservoir. 
An andesite-like composition for the nature of this source reservoir 
is further reinforced by the observation that the initial Hf isotope 
compositions of the approximately 4,450-Myr-old and 4,430-Myr-old 
zircons are also consistent with extraction from the same source 
(Fig. 2b). Indeed, taking the data at face value for each of the age groups 
returns a slope corresponding to an andesite-like !”°Lu/!”’Hf ratio of 
about 0.011. The fact that more evolved compositions for a primor- 
dial crust are not predicted by any model provides confidence that 
the formation age of this reservoir cannot be younger than 4,547 Myr. 
Therefore, this minimum age for the source of the NWA 7034 zircons 
represents the oldest differentiated silicate reservoir yet identified on 
Mars. Ancient Martian zircons with ages comparable to that reported 
here have been identified in igneous, evolved lithologies that are inter- 
preted to reflect re-melted primary Martian crust*. The enriched com- 
position for the NWA 7034 zircon source reservoir inferred from the 
andesite-like !”°Lu/1”’Hf ratio is consistent with this interpretation. 
Thus, our data require that a primordial crust existed on Mars by 4,547 
Myr ago and that it survived for about 100 Myr before it was reworked 
to produce magmas, possibly by impacts**, from which the NWA 7034 
zircons crystallized. We infer that this primordial crust represents a 
global reservoir given its longevity and the extended period of reworking 
indicated by the zircon data. 

The new timescales reported here for stabilization of the primordial 
Martian crust have far-reaching implications for understanding the 
accretion and differentiation history of Mars. Given that the formation 
of a stable primordial crust is the end product of the initial planetary dif- 
ferentiation, our data require that accretion, core formation and magma 
ocean crystallization on Mars was completed within 20 Myr of the for- 
mation of the Solar System. Such short timescales for primary accretion 
are predicted by planet formation models invoking pebble accretion 
where growth is fuelled by the gas-drag-assisted accretion of millimetre- 
sized objects, which leads to the efficient formation of Mars-sized 
embryos within the approximately 5-Myr lifetime of the protoplanetary 
disk?’°. Moreover, these timescales are also consistent with estimates 
based on the short-lived '’Hf-'®’W decay system for the timing of core 
formation, which is inferred to have occurred within 10 Myr of the for- 
mation of the Solar System”. By contrast, some recent studies have sug- 
gested that magma ocean crystallization was protracted on Mars, perhaps 
lasting up to 100 Myr after differentiation of the planet, on the basis of 
model ages deduced from the abundances of short-lived radionuclides in 
young Martian meteorites'®'*®, Such a protracted magma ocean crys- 
tallization is inconsistent with the data presented here and with thermal 
models suggesting that the solidification history of Mars must have been 
completed within about 10 Myr of accretion’. As such, the timing of 
silicate differentiation inferred from short-lived radionuclides in young 
Martian meteorites may not reflect primary differentiation of the planet 
but rather a younger, mantle-scale, fractionation event. Finally, our data 
and interpretation are fully consistent with models suggesting rapid 
magma ocean crystallization leading to a gravitationally unstable strati- 
fied mantle, which subsequently overturns, resulting in decompression 
melting of rising cumulates and extraction of a primordial basaltic-to- 
andesitic crust’. The extensive resurfacing of Mars by volcanism over 
the planet's history suggests that if any primordial crust is preserved, it 
will be deeply buried and may only be exposed in deep craters. 
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METHODS 
A total of seven zircon grains were extracted from a crushed bulk rock aliquot 
of NWA 7034 and analysed for U-Pb and Lu-Hf systematics. Given the limited 
number of zircons recovered from the crushing process and their small sizes, only 
one of the larger grains (S25b10) was chemically abraded’. This pre-treatment 
consisted of thermally annealing the less metamict domains of the crystal over 
three days at 900°C, followed by dissolution of the un-annealed portions using 
concentrated HF for 12 hat 180°C in Teflon capsules. Before complete dissolution, 
all zircon grains were cleaned in Pyrex beakers in an ultrasonic bath with alternat- 
ing steps of warm 3.5 M HNOs, HO and acetone. As we assumed that each grain 
might represent a different age, the grains were processed as single grains. The 
individual crystals were dissolved in separate PFE Teflon capsules in a HF:HNO3 
(3:1) mixture, together with the mixed *’Pb-”Pb-?U-*°U EARTHTIME U-Pb 
tracer*, for four days at 210 °C. The dissolved samples were evaporated to dryness 
and redissolved in 3.1 M HCl overnight. Uranium and lead were separated from 
the matrix elements by anion chromatography using 50-11 Teflon columns**° and 
dried down together with 8 11 of 0.1 M H3POq. They were loaded with silica gel*° 
on previously outgassed zone-refined Re filaments. The Pb and U isotopic ratios 
of the sample + tracer mixture were measured using the Triton Thermo-Fisher 
thermal ionization mass spectrometer at the Centre for Star and Planet Formation, 
University of Copenhagen, where each isotope was sequentially counted in a single 
axial ion counting system with Pb as Pbt, and U as UO*". The data were reduced 
offline and instrumental mass fractionation was accounted for by a linear mass- 
dependent fractionation law based on the Pb/?Pb ratio of the tracer. After 1 pg of 
laboratory Pb blank was removed from the analyses, the remainder of common Pb 
was assumed to have an isotopic composition, modelled after ref. °”. Instrumental 
mass-dependent fractionation of U was accounted for using the 7¥U/?*°U ratio of 
the tracer, which included a correction for the isobaric interference of 777U!°O!8O 
on the 7°U!°O, peak at mass 267. The °U/?°U ratio of mass 137.88 (ref. °°) and 
the #8U and *°U decay constants of ref. *° were used for calculation of U/Pb ratios 
and ages. All ratios and age uncertainties are quoted at the 95% confidence level. 
The Hf isotope composition and Lu/Hf ratios of individual zircons were deter- 
mined from the same sample digestion as that used for U-Pb age determination. 
Following collection of the high-field-strength element and rare-earth element 
(REE) washes from the U-Pb purification, approximately 5% of the solution was 
aliquoted for Lu/Hf ratio determination. The Hf was purified from the remaining 
solution by a two-step procedure using TEVA-spec and TODGA resins (Eichrom 
Industries) based on protocols outlined in refs 19 and 20. Zr was quantitatively 
separated from Hf, Ti and REE using 100-150 jum TEVA-spec resin in a 120-11 
column. The fractions containing high-field-strength elements and REE were 
loaded in 0.6 ml of 10.5M HCl anda Hf + Ti + REE cut was successively collected 
with 3.6 ml of 10.5 M HCl and 4.2 ml of 9.5 M HCL. Zr was recovered with 3.0 ml of 
6M HCL. Hf was purified from Ti and REE using 50-100 jum TODGA resin in 200-1l 
columns. The Hf + Ti+ REE fractions were loaded in 0.85 ml of 3.5 M HNO3- 
0.06 M boric acid and the Ti was eluted by adding 3.5 ml of 3.5 M HNO; while Hf 
and REE remained on the column. Hf was subsequently collected with 6 ml of 
1M HNO;-0.35 M HE This method returns Hf yields greater than 95% in Hf cuts 
with Zr/Hf ratios below 1. Hf isotope ratios were measured on the Pandora 
Thermo-Fisher Neptune Plus MC-ICPMS at the Centre for Star and Planet 
Formation, University of Copenhagen, using a sample-standard bracketing tech- 
nique. Samples were aspirated into the plasma source in 2% HNO3-0.1M HF 
solution via a Cetac Aridus II desolvating nebulizer using Ar and N2 sweep gases 
with an uptake rate of about 100 jl min™. Typical sensitivity at this uptake rate 
was 1,800 V per p.p.m. for Hf. Hafnium isotopic data were acquired in static mode 
using eight Faraday collectors allowing for simultaneous measurement of !”°Hf, 
HF 178HF HF and |®°Hf as well as monitoring potential isobaric interferences 
(76Yb on 1”°Hf, !”°Lu on !”SHf and 18°W on !®°Hf) using !”°Lu, !7!Yb and 1?W. 
Faraday detectors used to collect Hf isotopes and !®W were connected to amplifiers 
with 10!!-O feedback resistors, whereas !”°Lu and !71Yb were measured on Faraday 
detectors connected to amplifiers with 1013-0 feedback resistors. Sample analyses 
were interspersed with analyses of the JMC-475 standard as follows: JMC-475 (1), 
JMC-475 (2), sample-1, JMC-475 (3), JMC-475 (4). Samples and standards were 
analysed with a signal intensity of at least 1 V on mass '””Hf and ensuring that the 
signal intensity of the sample and standard were matched to within 5%. Samples 
were analysed once and the total amount of Hf consumed per analysis was typically 
2-5 ng for NWA 7034 zircons. Total procedural blanks were <10 pg for Hf, an 
amount that is negligible for all samples considering the amount of Hf available for 
analysis. All data reduction was conducted off-line using the freely available Iolite 
data reduction software” that runs within Igor Pro. Background intensities were 
interpolated using a smoothed cubic spline, as were changes in mass bias with time. 
Hf isotope data were corrected for mass bias using the exponential mass fraction- 
ation law, adopting !”Hf/!””Hf = 0.7325. The sample !”°Hf/!””Hf, !4Hf/!””Hf and 
18H £/!77Hf ratios were normalized to the JMC-475 reference values of 0.282160, 
1.46717 and 1.88667, respectively. Contributions from interfering species were on 


average 33 p.p.m. and 1 p.p.m. on '”°Hf/!””Hf from Yb and Lu, respectively, and 
26 p.p.m. on °°Hf/!”7Hf from W. Doping experiments with Yb interference levels 
at least ten times greater than typically observed in our sample demonstrate that 
our interference correction is accurate. The accuracy and external reproducibil- 
ity of our method was assessed by repeat analyses of the 91500 zircon reference 
standard*". In detail, seven aliquots of the same sample dissolution each containing 
approximately 5 ng of Hf were individually processed through our U-Pb and Hf 
purification scheme and analysed following the methods described above. The 
average values obtained for the '”°Hf/!””Hf, !”SHf/!””Hf and !°Hf/!””Hf ratios of 
the 91500 standard aliquots were 0.282311 + 0.000006, 1.46718 + 0.00002 and 
1.88669 + 0.00006, where the uncertainty represents the external reproducibility 
(2 standard deviations) (Table 1). The data we obtained for the 91500 zircon 
reference standard are identical to published values”. 

The Lu/Hf ratios were determined using the same MC-ICPMS as used for the 
Hf isotopic measurements using gravimetrically prepared mixed Lu—Hf standard 
solutions. In detail, the approximately 5% aliquot reserved for Lu/Hf ratio deter- 
mination was evaporated and re-dissolved in 2% HNO3-0.1 M HF prior to analysis. 
Samples were aspirated into the plasma source via a Cetac Aridus II desolvating 
nebulizer using Ar and N> sweep gases with an uptake rate of about 100 yl min“. 
The !”°Lu beam was collected on the axial secondary electron multiplier, whereas 
the !””Hf was collected on the H2 Faraday detector connected to an amplifier with 
a 10!!-( feedback resistor. Sample analyses were interspaced by analyses of the 
calibrated Lu-Hf standard solution as follows: Lu-Hf standard (1), Lu-Hf standard 
(2), sample-1, Lu-Hf standard (3), Lu—-Hf standard (4). Samples and standard were 
analysed with an intensity of at least 0.05 V on mass !’””Hf and 100.000 counts per 
second on mass !”*Lu, ensuring that the signal intensity of the sample and standard 
were matched to within about 5%. Total procedural blanks were <5 fg for Lu and 
negligible for all samples considering the amount of Lu available for analysis. The 
Lu-Hf standard solution was prepared gravimetrically to match the typical Lu/Hf 
ratio of zircon and is accurate to 2%. The external reproducibility of our approach 
was estimated by repeated analysis of the 91500 zircon standard. Analysis of ten 
individual aliquots of a single dissolution of the 91500 zircon standard yielded 
a !6L.u/!”7Hf ratio of 0.00030346 + 0.0000016 (2 standard deviations), which 
corresponds to an external reproducibility of 0.5% for the Lu/Hf ratio. Potential 
fractionation of the Lu/Hf ratio induced by U-Pb purification was evaluated by 
measuring the Lu/Hf ratios of aliquots of the 91500 zircon standard before and 
after U-Pb purification. Our tests demonstrate that potential fractionation of the 
Lu/Hf during U-Pb purification is less than 0.4%. Combining this with the uncer- 
tainty of the Lu-Hf standard solution, the external reproducibility of 0.5% and the 
potential fractionation of 0.4%, we infer an accuracy of 2.1% for our Lu/Hf ratio 
measurements. This represents the total uncertainty on the Lu/Hf ratio reported 
here and has been propagated in the final uncertainties quoted for the initial Hf 
isotope composition of the NWA 7034 zircons. 

Data availability. The authors declare that data supporting the findings of this 
study are available within the paper and the Methods. All other data are available 
from the corresponding author upon reasonable request. 
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Extended Data Fig. 1 | Photomicrographs of the NWA 7034 zircons 
analysed in this study taken under natural light. Given the small size 
and limited number of zircons recovered from the crushing process, we 
considered it to be preferable not to conduct additional imaging (using 
cathodoluminescence) because this necessitates extra manipulation of the 


individual grains, thereby increasing the risk of losing zircons. The fact 
that the zircons have mostly concordant U-Pb ages confirms their simple 
igneous history and, therefore, additional imaging to investigate potential 
zoning is not required here. 
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A synaptic threshold mechanism for computing 


escape decisions 
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Escaping from imminent danger is an instinctive behaviour that is 
fundamental for survival, and requires the classification of sensory 
stimuli as harmless or threatening. The absence of threat enables 
animals to forage for essential resources, but as the level of threat 
and potential for harm increases, they have to decide whether or 
not to seek safety!. Despite previous work on instinctive defensive 
behaviours in rodents”, little is known about how the brain 
computes the threat level for initiating escape. Here we show that 
the probability and vigour of escape in mice scale with the saliency 
of innate threats, and are well described by a model that computes 
the distance between the threat level and an escape threshold. 
Calcium imaging and optogenetics in the midbrain of freely 
behaving mice show that the activity of excitatory neurons in the 
deep layers of the medial superior colliculus (mSC) represents the 
saliency of the threat stimulus and is predictive of escape, whereas 
glutamatergic neurons of the dorsal periaqueductal grey (dPAG) 
encode exclusively the choice to escape and control escape vigour. 
We demonstrate a feed-forward monosynaptic excitatory connection 
from mSC to dPAG neurons, which is weak and unreliable—yet 
required for escape behaviour—and provides a synaptic threshold 
for dPAG activation and the initiation of escape. This threshold 
can be overcome by high mSC network activity because of short- 
term synaptic facilitation and recurrent excitation within the mSC, 
which amplifies and sustains synaptic drive to the dPAG. Therefore, 
dPAG glutamatergic neurons compute escape decisions and escape 
vigour using a synaptic mechanism to threshold threat information 
received from the mSC, and provide a biophysical model of how the 
brain performs a critical behavioural computation. 

Detecting and escaping from threats is an instinctive behaviour that 
reduces the chances of being harmed, but also results in the halting of 
other behaviours and the potential loss of resources. To balance escape 
with other survival behaviours, animals use sensory information and 
past experience to estimate threat and decide whether or not to escape. 
Although perceptual decision-making has been studied in primates and 
rodents using learned-choice tasks'*"?, and previous work has iden- 
tified key circuits for innate defence*®'*"*, the neurophysiological 
basis of escape decisions in mammals is largely unknown. Here we 
investigated escape in mice using innately aversive overhead expanding 
spots*'®, while varying the spot contrast to manipulate the saliency 
of the stimulus. Presentation of the stimulus while mice explored an 
arena with a shelter resulted in shelter-directed escape responses that 
were variable and probabilistic (Fig. la-c). Decreasing the stimulus 
contrast progressively increased reaction times and reduced escape 
probability, producing chronometric and psychometric curves similar 
to those from learned perceptual categorisation tasks!” (Fig. 1d, e, 
Supplementary Video 1). Response vigour (measured as the escape 
speed) also increased with contrast (Fig. 1f), showing that probability, 
reaction time and vigour of instinctive escape are innately matched 
to the saliency of the threat stimulus (see also Extended Data Fig. 1). 
The relationship between these variables was well described by a drift- 
diffusion model’””” that integrates a noisy threat level variable over 


time and implements the escape decision as a threshold-crossing pro- 
cess (Fig. 1g, see Methods). This was further supported by exposing 
mice to innately aversive ultrasonic sweeps, which generated escape 
with high probability, short reaction times and high vigour (Fig. 1b-f). 

Multiple brain regions contribute to instinctive defensive behav- 
iours>”*'418:19, so we next used optogenetic inactivation” of excita- 
tory neurons expressing vesicular glutamate transporter 2 (VGluT2*) 
to define critical circuit nodes for escape (Fig. 2a, b). Inactivation of 
the dPAG and mSC both severely affected escape—without affecting 
exploratory behaviour (Extended Data Fig. 2)—but in different ways. 
The inactivation of dPAG neurons switched the response to threat 
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Fig. 1 | Escape behaviour during threats of varying intensity. a, Video 
frames of escape to expanding spots. Yellow lines show the trajectory of the 
mouse during the preceding 2 s, stimulation onset is t= 0. b, Raster plot 

of mouse speed during escape trials for visual (top, organized by contrast) 
and sound (bottom) stimulation, sorted by reaction time (n = 13 mice). 

c, Single trial traces from one mouse escaping from different contrast 

spots (left) and sound (right). d-f, Chronometric (d) psychometric (e) and 
vigour (f) curves of contrast and escape behaviour; n = 13 mice, 209 trials; 
escape probability: P=2.5 x 107’, reaction time: P=3.5 x 10-8, vigour: 
P=1.6 x 10°. g, Theoretical model for computing escape from threat 
stimuli. Data points in d-f are means of trials pooled across mice, error 
bars are s.e.m., red lines are model fits to the data, P values are calculated 
using repeated-measures analysis of variance (ANOVA). 
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Fig. 2 | Encoding of threat and escape behaviour in the superior colliculus 
and periaqueductal gry. a, iChloC expression in VGluT2* dPAG neurons 
(top), speed raster during interleaved trials of threat presentation with 
light-off or on (middle), and summary for stimulation during dPAG 
inactivation (bottom) (Pescape = 0.03 + 0.03, Pfreeze = 0.86 + 0.06, mean 
freezing duration = 4.3 + 1.0 s;n=6 mice; escape: P=8.12 x 10>, freezing: 
P=0.00029; U-tests between light-off and light-on). b, Same as a for 
VGluT2* mSC inactivation (Pescape = 0.18 + 0.05, Pfreeze = 0.19 + 0.07; n=9 
mice; escape: P=5.15 x 1075, freezing: P= 0.02; U-tests as above). Reaction 
times are slower during mSC inactivation than during dPAG inactivation 
(P=0.002, two-tailed t-test). c, Field-of-view of dPAG VGluT2::GCaMPés 
neurons (top left), cell mask (bottom left) and single-trial examples (right). 
d, Average calcium response for active dPAG cells, aligned to escape and 
sorted by onset (57 out of 138 cells, n =3 mice, 55 trials). e, Left, distribution 
of dPAG cell onsets (curve is kernel density estimation, markers show 
onsets). Mean onset = —0.24+0.21 s (white marker, not different from 0 s; 
P=0.24, one-sample t-test). Right, example single-trial traces. 


from escape to freezing, with fast reaction times (269 + 35 ms, Fig. 2a; 
Supplementary Video 2), indicating that the threat was still detected 
and that the dPAG is specifically required to initiate escape. By con- 
trast, visual and sound stimulation after mSC inactivation produced 
no defensive response in 62 + 10% of light-on trials, which suggests 
that the link between the sensory stimulus and the response to threat 
was severely compromised (Fig. 2b, Supplementary Video 3). In the 
remaining trials, the reaction time was slow (1,443 + 255 ms, Fig. 2b) 
and the vigour of escape was reduced (Extended Data Fig. 2c), which 
is compatible with a reduction in the perceived level of threat. Similar 
results were obtained upon muscimol inactivation of the dPAG and 
mSC, whereas inactivation of the visual cortex (V1) or the amygdala 
caused only small decreases in escape probability and vigour (Extended 
Data Fig. 3). Next we performed calcium imaging of VGluT2* neu- 
rons in the deep layers of the mSC (dmSC) or in the dPAG in freely 


NATUR E|www.nature.com/nature 


f-h, Same as c-e for dmSC (177 out of 218 active cells, n = 8 mice, 111 
trials; mean onset = —1.51 £0.17 s, P=3.5 x 10”, Wilcoxon signed-rank 
test comparison with 0 s). i, Population activity for 98% contrast (z-score), 
grouped by trial outcome for dmSC (pink; 111 trials, P= 0.023, two-tailed 
t-test between escape and no escape; P=5.8 x 10~!°, one-sample t-test 
between no escape and 0 s) and dPAG (blue; 55 trials, P= 0.00028 and 
P=0.11, tests as for mSC). Dashed lines are activity without stimulus. 

j, ROC area under the curve (AUC, left), and AUC evolution for dmSC 
signals up to escape (right, 75 trials; error bars are s.d.). k, dmSC activity 
upon place entry increases after conditioning (left, dashed line is activity 
before conditioning; 57 trials, n =7 mice, P= 0.00013, two-tailed t-test 
between pre- and post-conditioning), whereas dPAG activity increases 
selectively upon escape (middle and right, z-score = 1.5 + 0.2, 20 trials, n =3 
mice, P= 0.0004, two-tailed t-test between pre- and post-conditioning). 
Box-and-whisker plots show median, interquartile range (IQR) and range. 
Error bars and shaded areas are s.e.m.; *** P< 0.001. 


behaving mice. Activity in both areas increased during stimulus-evoked 
escape (Fig. 2c, f), with a trial reliability of 28 + 3% for the dPAG and 
35 + 3% for the dmSC; this yielded a mean fraction of active cells 
of 14+5% and 23 + 6%, respectively, which was stable over multiple 
trials (Extended Data Fig. 4). However, the temporal profile of dPAG 
and dmSC activity was distinct. Whereas dPAG cells were active in the 
peri-escape initiation period (Fig. 2d, e), activity in most dmSC cells 
preceded escape onset (Fig. 2g, h), and this temporal difference was 
also reflected in the ensemble activity onset (onset relative to the start 
of escape: —0.25 £0.48 s for dPAG, —1.77£0.5 s for dmSC, P=0.59 
and P=0.00075 respectively, two-tailed t-test comparison with escape 
onset). Sorting trials from the same stimulus contrast by trial outcome 
(Fig. 2i) showed that dmSC neurons encode the threat stimulus, and 
also reflect the choice to escape (z-score = 1.93 + 0.23 for escape, 
1.18 £0.11 for no escape), whereas activity in dPAG neurons increases 
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Fig. 3 | Optogenetic stimulation shows different roles for mSC and 
dPAG in escape behaviour. a, Speed traces with increasing light intensity 
(10 Hz pulse, black lines) from one mouse (left, mSC; right, dPAG). 
b, Psychometric curve (mSC: 278 trials, n =4 mice, slope = 4.0, 95% 
confidence interval (2.75, 5.25)); dPAG: 590 trials, n =7 mice, slope = 26.3, 
95% confidence interval (22.1, 30.4)). Lines are logistic fits (pooled across 
all mice and binned light intensities), inset shows fit slope (error bars 
are s.d.). c, Chronometric curve (mSC: 149 trials, slope = —0.21, 95% 
confidence interval (—0.27, —0.15); dPAG: 328 trials, slope = —0.07, 95% 
confidence interval (—0.11, —0.03)). Lines are linear fits, inset as b. 
d, Correlation between light intensity and escape speed (mSC: 149 trials, 
P=0.04; dPAG: 328 trials, P= 1.5 x 107°; Pearson's r). Error bars are 
s.e.m. unless otherwise indicated, mSC data are shown in purple and 
dPAG in blue. 


exclusively in escape trials (z-score = 2.28 + 0.17 for escape, 0.49 £0.19 
for no escape). Receiver-operator characteristic (ROC) analysis of 
ensemble activity reflected this difference, and showed that an ideal 
observer of dmSC activity could predict the decision to escape 900 ms 
before escape initiation (68% correct; Fig. 2j). Ensemble dmSC activity 
also showed a strong negative correlation with reaction time, further 
suggesting that it is important for escape initiation (Extended Data 
Fig. 4i, j). To test further the nature of dmSC signals, we exposed mice 
to a place-aversion paradigm that resulted in spontaneous flight upon 
approaching the threat area (Extended Data Fig. 5, Supplementary 
Video 4). The activity of dmSC neurons after conditioning increased 
upon place entry and preceding escape, despite there being no stimulus 
presentation (z-score = 1.94 + 0.17; Fig. 2k). Importantly, pre-escape 
activity was still predictive of escape, and not related to head-rotation 
movements (Extended Data Fig. 4k), which indicates that dmSC neu- 
rons encode a variable that is correlated with the likelihood of escape. 
In agreement with the threat-stimulus data, dPAG neurons are active 
only during, and not before, escape initiation (Fig. 2k). In addition, 
there was a correlation between escape speed and peak calcium activity, 
which was approximately three times stronger in the dPAG than in 
the dmSC, and was specific for running during escape to the shelter 
(Extended Data Fig. 4], m). 

These activity profiles are consistent with dmSC neurons representing 
a pre-escape variable, such as threat intensity, whereas dPAG neurons 
encode the result of the threat-threshold computation. This predicts 
that direct activation of the dmSC should produce psychometric 
and chronometric curves that are similar to those produced by 
sensory stimulation, as activity is still being passed through the thresh- 
old mechanism to initiate escape, whereas dPAG stimulation should 
reliably elicit escape behaviour with short reaction times. We tested 
this prediction using in vivo channelrhodopsin-2 (ChR2) activation 
of dmSC or dPAG VGluT2? neurons (Fig. 3a), which recapitulated 
shelter-directed flights (Extended Data Fig. 6a—c, Supplementary 


LETTER 


Video 5). Gradually increasing the activation of the dmSC network by 
increasing light intensity progressively increased the escape probability 
and decreased the response variability (Fig. 3b, c), whereas increasing 
dPAG activity produced a steep, all-or-none curve, with stereotyped 
responses for each intensity (Fig. 3b, c), in agreement with our model 
hypothesis. Reaction times also decreased with stronger dmSC activa- 
tion, whereas escape latencies for dPAG activation were short across 
the stimulation range (Fig. 3d), demonstrating that dmSC activity 
determines the escape onset. Stimulation strength was also correlated 
with escape speed, but the correlation was stronger for dPAG than for 
dmSC stimulation (Fig. 3e), which suggests that dPAG activity repre- 
sents a post-threshold variable from which escape vigour is computed. 
Moreover, dmSC activation while inactivating the dPAG did not elicit 
escape, whereas inactivation of an alternative dmSC projection target— 
the parabigeminal nucleus (PBGN)°—did not impair escape, suggesting 
that threat information from the dmSC has to flow through the dPAG 
to initiate escape (Extended Data Fig. 6d-i). 

To determine whether mSC neurons project directly to dPAG 
neurons, we performed monosynaptic rabies tracing. This revealed 
a feed-forward connection with a 11:1 SC:dPAG convergence ratio, 
consisting of mostly medially located excitatory cells (Fig. 4a; Extended 
Data Fig. 7). Optogenetic activation of VGluT2* dmSC axons in vitro 
elicited excitatory monosynaptic input in 61% of VGluT2* dPAG 
neurons (Fig. 4b, left; Extended Data Fig. 8a—e). However, the con- 
nections were weak (peak excitatory postsynaptic current (EPSC): 
—37.9+11.9 pA), with high failure rates (20.3 + 8%) and low quantal 
content (2.3 + 0.6), and followed Poisson statistics, indicating a very 
low synaptic release probability (Fig. 4c, Extended Data Fig. 8f-h). 
Consequently, the probability of firing dPAG neurons was extremely 
low (0.02 + 0.01 for single light-pulses; Fig. 4d, e), providing a synaptic 
threshold for dmSC activity to engage the dPAG. However, repeated 
light stimulation elicited more action potentials than would be expected 
from temporal summation (spikes per pulse: 0.17 +0.1 for 10 Hz, 
0.16 + 0.08 for 20 Hz; membrane time constant = 28.3 +3 ms, signifi- 
cantly different from the 20-Hz inter-stimulus interval, P=5.8 x 10~%, 
one-sample t-test against 50 ms; Fig. 4e and Extended Data Fig. 8b). 
This happens because first, the connection facilitates (20 Hz paired- 
pulse ratio (PPR) = 1.22 + 0.09, 10 Hz PPR= 1.04 + 0.08), which 
provides input amplification at the synaptic level (Fig. 4f). Second, 
dmSC stimulation triggered a long-lasting increase in the frequency 
of spontaneous EPSCs (sEPSCs), which decayed to baseline with 
a time constant of 0.49 s (Fig. 4g). Recordings of VGluT2* dPAG- 
dPAG and dmSC-dmSC connectivity showed weak and sparse dPAG 
input onto dPAG cells (27%, —54+ 8.3 pA), whereas 100% of dmSC 
cells received strong input from other dmSC cells (—146.7 + 41.5 pA, 
Fig. 4h), which is in agreement with previous work”! and suggests 
that recurrent excitation in the dmSC amplifies signals at the network 
level. Together, these synaptic and network mechanisms allow sus- 
tained dmSC activation to overcome the weak connection to VGluT2* 
dPAG neurons and drive firing of the escape network. In vivo silicon 
probe recordings in awake, head-fixed mice showed that during threat 
stimuli???3, dmSC single units fire in the short-term facilitation 
frequency range of the dmSC-dPAG synaptic connection (73 units 
from 3 mice, Extended Data Fig. 9), in a contrast-dependent manner 
(peak firing rate: 20.4+4.1 Hz for 98%, 10.7 41.8 Hz for 50%, 
23.9 +2.5 Hz for sound, Fig. 4i). Moreover, a fraction of units sus- 
tained increased firing beyond the stimulus (37% of visual- and 15% 
of sound-responding units; time constant to decrease to baseline: 0.23 
sand 5.8 s, respectively; Fig. 4j), in agreement with recurrent dmSC 
activity assisting with the integration to threshold. In the final set of 
experiments, we tested whether the dmSC-dPAG connection is crit- 
ical for computing escape. We co-expressed the synaptically-targeted 
inhibitory designer receptor hM4D-neurexin (hM4D"™")** and ChR2 
in VGluT2* dmSC neurons, which caused a 71 +7% reduction in syn- 
aptic transmission to the dPAG in the presence of clozapine-N-oxide 
(CNO), while leaving dmSC neuron firing intact (Fig. 4k, Extended 
Data Fig. 10a, b). In vivo microinfusion of CNO over dmSC-dPAG 
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Fig. 4 | Neural circuit and biophysical mechanisms for computing 
escape behaviour. a, Left, PAG VGluT2* (blue) and presynaptic cells 
(pink) from rabies tracing, for deep SC (dSC), intermediate SC (iSC) and 
superficial SC (sSC). Right, SC:dPAG convergence ratios for single dPAG 
cells. b, mSC-dPAG connectivity (left; n = 79 cells, n =21 mice), example 
traces (middle), and failures summary (right, n = 8 cells, n =7 mice). 

c, Direct quantal content versus estimation from failure rate (fit slope = 
0.92, 95% confidence interval (0.74, 1.1); n= 15 cells). Blue line, linear 
fit; dashes, unity line (see Methods). d, dPAG voltage response to mSC 
stimulation. e, Spiking probability summary (n = 20 cells, n= 10 mice; 
plot shows mean and s.e.m.). f, Example average trace (left) and summary 
(right; n = 11, 18 cells; n=7, 8 mice for 10 Hz, 20 Hz respectively). 

g, dPAG example trace during mSC stimulation (middle) and sEPSC 
raster (5 trials, top). Bottom, sEPSC frequency summary (n = 21 cells, 
n=9 mice). Dashed line, exponential fit. h, Examples and connectivity 
for dPAG (top; n= 11 cells, n =2 mice) and dmSC (bottom; n= 22 cells, 
n=10 mice). i, Top, firing-rate histograms and spike rasters from 

one dmSC single unit. Bottom, summary data (32 visual- and 

45 sound-responsive units, P=0.01 for 50% versus 98%, P=2.8 x 10-> 


synapses blocked escape in response to visual stimuli (Extended Data 
Fig. 10c) and optogenetic dmSC activation, similar to systemic CNO 
injection (Fig. 4k, 1, Supplementary Video 6). Notably, doubling the 
intensity or the frequency of optogenetic stimulation was not sufficient 
to rescue escape (Extended Data Fig. 10a, d), whereas inhibiting the 
dmSC projection to the lateral posterior nucleus of the thalamus (LP) 
did not affect escape (Fig. 41). 

Our results support a model in which threat evidence is integrated 
in the dmSC and passed through a synaptic threshold at the dPAG 
level to initiate escape (Fig. 4m). Although it is likely that several mSC 
projections support escape behaviour, we show that the dmSC-dPAG 
synaptic connection is required for the initiation of escape, whereas 
SC projections to LP*” are not, which suggests that there might be 
dedicated projections for controlling freezing’ and escape. Also, in 
contrast to previous work° using optogenetic activation of SC pro- 
jections to the PBGN, we did not find a critical role for this pathway 
in escape initiation, which could be explained in previous studies by 
antidromic activation of SC neurons projecting to both PBGN and 
dPAG, or by back-projections to the SC. A key result is that dmSC 
activity encodes a high-order signal predictive of escape, in agreement 
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for 50% visual versus sound). j, Top, example unit showing persistent 
activity, and average histogram for cells with persistent activity (bottom, 
dashed line shows exponential fit; 7 units). k, hM4D"™ activation does 
not affect ChR2-evoked mSC cell firing (top), but blocks mSC-dPAG 
EPSCs (middle). Bottom, example speed rasters during mSC activation 
before (left) and after (right) CNO microinfusion to the mSC-dPAG 
projection. 1, Summary of CNO application to mSC-PAG (local CNO 
injection compared to baseline: Pescape = 0.08 + 0.05 versus Pescape-base = 
0.87 + 0.03, n=5 mice, P= 0.0008; i.p. CNO compared to baseline: 
Pescape = 0.05 + 0.05 versus Pescape-base = 0.97 + 0.03, n=4 mice, 

P=0.01; P=0.5 for local versus ip. CNO), and mSC-LP projection 
(local CNO compared to baseline: Pescape = 1.0 + 0 versus Pescape-base = 
0.95 + 0.03, n=4 mice, P= 0.1; ip. CNO versus baseline: 

Pescape = 0.04 + 0.02 versus Pescape-base = 0.89 + 0.06, n=3 mice, P= 0.04; 
P=0.01 for local versus i-p. CNO). Saline mSC-dPAG microinfusion and 
CNO ip. injection without hM4D"™" do not reduce escape (n = 5 mice, 
P>0.15). m, Escape decision model. Shaded areas show s.e.m; box-and- 
whisker plots show median, IQR and range. P values: two-tailed U-test. 


with its role in multisensory integration” and decision making’>. 


Successfully escaping from threats to reach safety requires the inte- 
gration of multiple information streams, including knowledge about 
the spatial environment”, and our results provide a mechanistic entry 
point for understanding how the brain computes a fundamental sur- 
vival behaviour, and goal-directed behaviours in general. 
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Any Methods, including any statements of data availability and Nature Research report- 
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METHODS 
Mice. Male and female adult C57BL/6J wild-type, VGluT2-ires-Cre”? (Jackson 
Laboratory, stock 016963) and VGluT2::eY FP (R26 eYFP, Jackson Laboratory 
006148; eYFP, enhanced yellow fluorescent protein) mice were housed with 
free access to chow and water on a 12:12 h light:dark cycle and tested during the 
light phase. All experiments were performed under the UK Animals (Scientific 
Procedures) Act of 1986 (PPL 70/7652) following local ethical approval. Minimum 
sample sizes were predetermined from power estimates based on pilot experi- 
ments. Animals in test and control groups were littermates and randomly selected. 
Behavioural experiments were not performed blinded as the experimental setup is 
closed-loop and automatically delivers stimuli. Behavioural data were annotated 
blinded and by several experimenters. 

Surgical procedures. Mice were anaesthetized with an intraperitoneal (i-p.) 
injection of ketamine (95 mg kg!) and xylazine (15.2 mg kg~!), and carpofen 
(5 mg kg~’) was administered subcutaneously. Isoflurane (0.5-2.5% in oxygen, 
11 min~') was used to maintain anaesthesia. Craniotomies were made using a 
0.5-mm burr and viral vectors were delivered using pulled glass pipettes (10 jl 
Wiretrol II with a Sutter P-1000) in an injection system coupled to a hydraulic 
micromanipulator (MO-10, Narishige) on a stereotaxic frame (Model 1900 and 
963, Kopf Instruments), at approximately 10 nl min” !. Implants were affixed using 
light-cured dental cement (RelyX Unicem 2, 3M) and the wound sutured (6-0, 
Vicryl Rapide) or glued (Vetbond). Coordinates are measured from lambda. 
Viruses. The following viruses were used in this study and are referred to by 
contractions in the text. For optogenetic activation, adeno-associated virus 
(AAV) AAV2-EFla-DIO-hChR2(H134R)-eYFP-WPRE (3.9 x 10’* genome cop- 
ies per ml (GC ml~!)) and AAV2-EFla-DIO-hChR2(H134R)-mCherry-WPRE 
(6.6 x 102 GC ml}; Deisseroth) were acquired from the UNC Vector Core. 
Optogenetic inhibition experiments were performed with AAV9-Efla-DIO-iChlo- 
2A-tDimer (3.75 x 10’? GC ml 15a gift from S. Wiegert and T. Oertner) or AAV1- 
EF1la-DIO-iChloC-2A-dsRed (5 x 10° GC ml"!; Addgene 70762, a gift from 
T. Margrie). For control and calcium-imaging experiments respectively, AAV2-EFla- 
DIO-eYFP-WPRE (4.0 x 10!2 GC ml!) and AAV9-CAG-DIO-GCaMP6s-WPRE 
(6.25 x 102 GC ml!) were acquired from Penn Vector Core. For retrograde rabies 
tracing, EnvA pseudotyped SADB19 rabies virus (EnvA-dG-RV-mCherry) was 
used in combination with AAV8 coding for the EnvA receptor TVA and rabies 
virus glycoprotein (RG) that were prepared from pAAV-EF la~-FLEX-GT (Addgene 
plasmid 26198, Callaway) and pAAV-Syn-Flex-RG-Cerulean (Addgene plasmid 
98221, Margrie). All viruses used for rabies tracing were a gift from T. Margie*, and 
had been previously tested for leakiness and specificity*!. Additionally, a recom- 
binant AAV with retrograde functionality (rAAV2-retro-mCherry, 6.97 x 10!” 
GC ml" !, Addgene 81070°”) was used. For chemogenetic inhibition experiments, 
AAV5-CAG-DIO-mCherry-2A-hM4D-HA-24-nrxn1A (3.9 x 10’? GC ml“|, a gift 
from S. Sternson) or AAV2-CAG-DIO-mCherry-2A-hM4D-nrxn1A (6.19 x 10! 
GC ml}, Addgene 60544) were used. 

Behavioural procedures. Experimental set-up. All behavioural experiments were 
performed in a rectangular Perspex arena (W: 20 cm x L: 60 cm x H: 40 cm) 
with a red-tinted shelter (19 cm x 10cm x 13.5 cm) at one end, housed within 
a sound-deadening, light-proofed cabinet with six infrared light-emitting diode 
(LED) illuminators (TV6700, Abus). A screen (90 cm x 70 cm; ‘100 micron drafting 
film, Elmstock) was suspended 64 cm above the arena floor, and a DLP projector 
(IN3126, InFocus) back-projected a grey uniform background via a mirror, pro- 
viding 7-8 lx at the arena floor. Experiments were recorded at 50 frames per second 
with a near-IR GigE camera (acA1300-60gmNIR, Basler) positioned above the arena 
centre. Video recording, sensory and optogenetic stimulation was controlled with 
custom software written in LabVIEW (2015 64-bit, National Instruments) and 
Mantis software (mantis64.com). The position of the mouse was tracked online, 
and used to deliver stimuli when the mouse entered a predefined ‘threat area (21 cm 
x 20 cm area at opposite end to shelter). An empty plastic Petri dish (replaced fresh 
for each mouse; 35 mm) was affixed to the arena floor in the centre of the threat area 
to enrich the environment. All signals and stimuli, including each camera frame, 
were triggered and synchronised using hardware-time signals controlled with a 
PCle-6351 board (National Instruments). 

Protocols. Mice were placed in the arena and given 8 min to explore the new envi- 
ronment, after which sensory stimuli were delivered when the mouse entered the 
threat area for longer than 100 ms. A typical experiment lasted 30-60 min. In the 
standard visual stimulation protocol, we used a pseudo-random contrast sequence 
to minimise the development of aversion or habituation during the behavioural 
session (see Extended Data Figs. 1 and 5e, f for quantification). The sequence con- 
sisted of a first stimulus at 98% contrast, followed by a random selection without 
replacement from the remaining contrasts, and this process was repeated until the 
end of the behavioural session. Each stimulus was delivered with an inter-stimulus 
interval of at least 30 s. For the conditioning protocol shown in Fig. 2k and Extended 
Data Fig. 5, repeated presentations (3-6 trials) at 98% contrast were delivered with 
no minimum inter-stimulus interval after a 10-min acclimatization period. 


Sensory stimuli. The standard visual stimulus was a sequence of five dark expanding 
circles, and unless otherwise stated, each subtended a visual angle of 2.6° at onset 
and expanded linearly at 118° s~' to 47° over 380 ms, after which it maintained the 
same size for 250 ms and began an inter-stimulus interval of 500 ms. The contrast 
of the spot was varied in a number of experiments, and for clarity is reported as a 
positive percentage (low to high; for example, 25% to 98%), converted from the 
negative Weber fraction (low to high; —0.25 to —0.98). The contrast was varied 
by altering the intensity of the spot against a grey screen maintained at constant 
luminance (standard luminance, 7.95 cd m~). The spot was located on the screen 
directly above the centre of the threat area, approximately 15° from the zenith of 
the mouse. The auditory stimulus consisted of a frequency-modulated upsweep 
from 17 to 20 kHz over 3 s (ref. **). Waveform files were created in MATLAB 
(Mathworks), and the sound was generated in Lab VIEW, amplified and delivered 
via an ultrasound speaker (L60, Pettersson) positioned 50-55 cm above the arena, 
centred over the threat area. 

Analysis. Behavioural video and tracking data was sorted into peri-stimulus trials 
and manually annotated. Detection of the threat stimulus was assumed if the mouse 
showed a stimulus-detection response, in which the ears of the mouse move poste- 
riorly and ventrally, which precedes interruption or commencement of body move- 
ment. To differentiate failures of escaping from failures of attending to the stimulus, 
trials with no stimulus-detection response were excluded from the analysis. This 
resulted in the exclusion of three no-escape trials from the 25% contrast dataset, 
which increased the escape probability from 0.12 to 0.13. The onset of escape was 
measured as the first video frame marking the onset of a continuous movement 
consisting of a head turn followed by a full-body turn towards the shelter. Escape 
was annotated automatically and defined as the mouse moving to enter the shelter in 
a single movement without stopping, within 0.9 s after stimulus termination (or 6s 
after approaching a 15-cm boundary from the threat area for spontaneous escapes 
after conditioning). Behaviour metrics were calculated by pooling all trials and mice 
(Fig. 1d-f) and also by analysing each mouse individually and then computing an 
average value across all mice (Extended Data Fig. 1a—c). Statistical analysis was 
performed using the number of mice as the sample size. The escape probability for 
a given stimulus is the fraction of trials which led to an escape to the shelter. The 
maximum speed of the escape is calculated as the peak value of the speed trace 
between the onset of the escape and entry to the nest. Quantification of exploratory 
behaviour was done for behavioural sessions lasting at least 40 min, by calculating 
the cumulative displacement of the mouse in 1-min bins followed by smoothing 
with a five-point flat window. We did not observe any differences in the behav- 
ioural response to threat stimulation between male and female mice, and therefore 
data from both sexes has been pooled (for 98% contrast stimulation, escape prob- 
ability: 0.86 for males, 0.88 for females, P= 1.0, Fisher exact test; reaction time: 
369.2 +51.8 ms for males, 365.6 + 39.6 ms for females, P= 0.96, two-tailed t-test; 
vigour: 91.8+4.5 cm s-! for male, 89.1 + 11.1 for female, P=0.81, two-tailed t-test). 
Behavioural model. The threat level (T) evolves over time according to 


dT 
Te =-T+Ca(t)+o,w 


where a(t) is the diameter of the expanding visual spot scaled by the spot contrast 
C. The variable ry sets the time constant for changing the threat level and W is a 
white-noise Wiener process parametrised by oy. At each time point, T is compared 
against a threshold B, and escape initiated if T > B. The reaction time is the time 
at threshold crossing measured relative to stimulus onset. In this model we allow 
the threat level to continue evolving after the threshold has been crossed, similar 
to previous work on changes of mind during decision making™, and escape vigour 
Vis computed from the peak of the threat level as a logistic function: 


1 


(—k(T-Bs)) 


V= 
l+e 


The model was first fitted with three free parameters (B, T7, on) to the reaction 
time and escape probability data simultaneously by simulating 10,000 trials for 
each parameter set and using the brute force method in LMFIT Python 2.7 pack- 
age. Escape vigour was then fitted to the average peak threat levels across all trials 
with free parameters k and s using least-squares minimisation in LMFIT. The fit 
parameters for the curves shown in Fig. 1 are: B= 0.165, Tr = 1,200 ms, on=0.6, 
k=90,s=1.5. 

Pharmacological inactivation. Mice were bilaterally implanted with guide cannu- 
lae (Plastics One, Bilaney Consultants) over the target region (see Supplementary 
Table 1) and given at least 48 h for recovery. On the test day, mice were placed in the 
standard arena for 10 min and escape responses were assessed with a single visual 
stimulus (one 98% contrast expanding spot) or auditory stimulus. Additionally, 
in PBG- and PAG-cannulated mSC-VGluT2::ChR2 mice, optogenetic responses 
were also evoked. The mice were then lightly anaesthetized in an induction cham- 
ber and placed on a heating pad where anaesthesia was maintained with a nose 
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cone (2% isoflurane, 1 1 min~'). Internal cannulae were inserted and sealed with 
Kwik-Sil. Muscimol-BODIPY-TMR-X (0.5 mg ml!) or Alexa-555 (100 ,.M; Life 
Technologies), dissolved in 1:1 phosphate-buffered saline (PBS): 0.9% saline with 
1% dimethyl sulfoxide (DMSO), was then infused at a rate of 70-100 nl min~! 
using a microinjection unit (10 jl Model 1701 syringe; Hamilton, in unit Model 
5000; Kopf Instruments) followed by a 5-min wait period per hemisphere. Mice 
spent no longer than 30 min under anaesthesia and were given 30 min to recover 
in the home cage, after which they were placed back in the cleaned arena and 
subjected to visual, auditory or optogenetic stimulation. Immediately upon 
termination of the behavioural assay, around 1 h after infusion, mice were anaes- 
thetized with isoflurane (5%, 21 min~') and decapitated. Acute slices (150 jum) were 
cut using a microtome (Campden 7000smz-2 or Leica VT 1200S) in ice-cold PBS 
(0.1 M), directly transferred to 4% paraformaldehyde (PFA) solution, and kept 
for 20 min at 4°C. The slices were then rinsed in PBS, counter-stained with 4’,6- 
diamidino-2-phenylindole (DAPI; 3 |.M in PBS), and mounted on slides in 
SlowFade Gold (Life Technologies) before wide-field imaging (Nikon TE2000) 
on the same day to confirm the site of infusion. Behavioural data was annotated as 
described. For the calculation of the maximum exploration speed, the peak speed 
of the 7-min acclimatisation period before stimulation was used. Statistical analysis 
was performed using the number of mice as the sample size. 

Calcium imaging in freely-moving mice. Data acquisition. A miniaturised 
head-mounted fluorescence microscope (Model L, Doric Lenses Inc.) was used 
to image GCaMP6s in neurons of male VGluT2-Cre mice. AAV9-CAG-Flex- 
GCaMP6s (300-550 nl; Penn Vector Core) was injected into the mSC (anteroposte- 
rior, AP: —0.2 to —0.5; mediolateral, ML: +-0.25; dorsoventral, DV: —1.6) or dPAG 
(AP: —0.4 to —0.6, ML: +0.25, DV: —2.2). At the level of the inferior colliculus, 
the dura was incised using a 30G needle, and gently pulled forward to partially 
reveal the SC. A GRIN lens-equipped cannula (SICL_V_500_80; Doric Lenses 
Inc.) was used to push forward the transverse sinus and inserted to the same depth 
as the injection coordinates, after which the craniotomy was covered with Kwik- 
Cast and the cannula affixed with dental cement. At least 21 days after surgery, 
the microscope was attached to the mouse without anaesthesia, and the mouse 
was placed back in the home cage for 5-10 min, for acclimatisation to the micro- 
scope. During this period, the optimal imaging parameters for the preparation 
were determined: the acquisition rate was 14.2 Hz in most experiments (median; 
range: 10-20 Hz) with an excitation power of 450 \.W (median; range: 0.2-1.1 
mW). After a baseline period of 7 min, mice were exposed to visual and/or auditory 
stimulation. For the visual stimulation, the contrast was 98%, the inter-stimulus 
interval was 750 ms, and the post expansion period was 20 ms, with the total 
epoch length and expansion rate unchanged. A typical session lasted 1.5 h (1-3 
sessions per mouse), with imaging data acquired during stimulation and control 
trials in approximately 5-min epochs, with at least 2 days between sessions. If 
the mouse showed prolonged bouts of inactivity, imaging was halted to minimize 
photobleaching. Fluorescence and behavioural frame trigger signals were acquired 
at 10 kHz for offline synchronisation. 

Data analysis. Behavioural video and tracking data were sorted into peri-stimulus 
trials and manually annotated to mark behavioural events as described above. 
Fluorescence stacks were registered** and background-subtracted (Fiji). Cell body- 
like structures were identified manually as regions-of-interest (ROIs; elliptic or 
polygonal areas) in Fiji using the maximum intensity projection of registered mov- 
ies, aided by inspection of deconvolved images. For each mouse, ROI masks were 
rigidly translated to account for field-of-view (FOV) movement between imaging 
sessions, and new cells added to the FOV if they became visible. In some cases, 
the FOV moved such that ROIs could not be mapped to the previous sessions, and 
it was therefore counted as a new FOV. Mean intensity traces were extracted for 
each ROL, interpolated with the behavioural video frames and tracking data, and 
the change in fluorescence intensity relative to the resting fluorescence intensity 
(AF/F) calculated on a trial-by-trial basis with a baseline of 5 s before stimulus 
onset. Traces were then smoothed with a 20-point Hanning window and z-scored. 
ROIs were only included in the analysis if they had transients with a z-score above 
2 at any time during the recording session, to ensure that they were live, active 
neurons. Average responses for each cell were obtained by averaging across all 
trials independent of the trial outcome and statistical analysis was performed on 
all cells pooled together. Ensemble average responses were obtained by averaging 
the responses of all cells ina FOV and summary statistics calculated over all trials 
for each FOV. For the ROC analysis, the annotated behavioural outcomes were 
used to sort data into ‘Escape’ and ‘No Escape’ classes, and the ROC curves and 
AUC statistics were calculated using the open-source package Scikit-learn. The 
s.d. for the AUC was estimated using bootstrapping. ‘Peri’ and ‘Pre-escape’ time 
periods were defined as escape onset + 1 s and <1 s, respectively. For the plot 
in Extended Data Fig. 4i, escape latencies were first binned and average calcium 
signal waveforms calculated for each bin, and the signal rise slope was obtained by 
fitting a linear function (y= mx + b). The onset of calcium signals was measured 
by finding the time of the peak and iteratively moving backwards along the signal 
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to determine the time point at which the signal reaches the baseline. Peak calcium 
responses after conditioning were taken from a 5-s time window starting when 
the mouse entered the threat area. 

Optogenetic experiments. For optogenetic activation*”, VGluT2-Cre and 
VGluT2::eYFP mice were injected with AAV-DIO-ChR2-eYFP or -mCherry, (see 
Viruses) into the dmSC (80-120 nl per side, ML: £0.2 to 0.35, AP: —0.25 to —0.45, 
DV: —1.4 to —1.55) or dPAG (40-80 nl per side ML: +0.0 to —0.4, AP: —0.4 to 
—0.6, DV: —1.95 to —2.2). Control mice were injected with 120 nl AAV2-DIO- 
eYFP into the dPAG. One optic fibre (200-j1m diameter, MFC-SMR; Doric Lenses 
Inc.) was implanted per mouse, medially, 250-300 1m dorsal to the injection site. 
For optical stimulation, light was delivered by a 473-nm solid-state laser (CNI) in 
conjunction with a continuous neutral density filter wheel for varying light inten- 
sity (NDC-50C-4M, Thorlabs) and a shutter (LS6, Uniblitz) driven by trains of 
pulses generated in LabVIEW. In some experiments, this system was substituted by 
a laser diode module (Stradus, Vortran) with direct analogue modulation of laser 
intensity. Magnetic patchcords (Doric Lenses Inc.) were combined with a rotary 
joint (FRJ 1x1, Doric Lenses Inc.) to allow the cannula to be connected without 
restraint and allow unhindered movement. In all experiments, mice were placed in 
the standard arena and given 8 min to acclimatise. As the fraction of cells spiking 
in a ChR2-expressing neuronal network increases as a function of light intensity 
in vivo*’, we chose to systematically modulate light intensity as a proxy for setting 
the level of activation in the dPAG and mSC. For the intensity modulation assay, 
the laser intensity was set initially to give a low irradiance (0.1-0.2 mW mm ~”) 
that did not evoke an observable behavioural response. Mice were photostimulated 
(473 nm, train of 10 light pulses of 10 ms at 10 Hz) upon entering the threat area 
with an inter-stimulus interval of at least 30 s. After at least three trials of this 
intensity, the irradiance was increased by 0.1-0.3 mW mm *? until a behavioural 
response was observed, after which 8-15 trials were obtained at a given intensity, 
before further increasing the light intensity. This process was iterated until an 
intensity was reached which always evoked a flight response (Pescape = 1). For one 
mouse, the standard stimulus was not sufficient to reach Pescape = 1 and the curve 
was acquired with a higher frequency stimulus (10 light pulses of 10 ms at 20 Hz). If 
the mouse stopped exploring the arena, precluding P.scape= 1 from being obtained, 
the experiment was terminated after 4 h and not analysed. To normalize stimula- 
tion intensity and compare across mice, trials were first classified as escape if the 
mouse reached the shelter within 5 s of stimulation onset, to calculate the fraction 
of escape trials at a given intensity. The escape probability curve of each mouse 
was then fitted with a logistic function (1/(1 + eK) and light intensities were 
normalized to x0. In the frequency modulation assay, high laser power was used 
(range, 12-13.5 mW mm”) and the stimulus consisted of 10 light pulses of 10 ms 
at either 2, 5, 10, 20 and 40 Hz, delivered in a pseudo-random order. 

For histological confirmation of the injection site, mice were anaesthetized with 
Euthatal (0.15-0.2 ml) and transcardially perfused with 10 ml of ice-cold PBS with 
heparin (0.02 mg ml") followed by 4% paraformaldehyde (PFA) in PBS solution. 
Brains were post-fixed overnight at 4°C then transferred to 30% sucrose solution 
for 48 h. Sections (30 1m) were cut with a cryostat (Leica CM3050S) and a standard 
free-floating immunohistochemical protocol was used to increase the signal of the 
tagged ChR2 and counter-stain neurons. The primary antibodies used were anti- 
GFP (1:1,000, chicken; A10262, or rabbit; A11122, Life Technologies), anti-RFP 
(1:1,000, rabbit; 600-401-379, Rockland) and anti-NeuN (1:1,000, mouse; MAB- 
377, Millipore) and the secondary antibodies were Alexa-488 Donkey anti-rabbit 
and Goat anti-chicken, Alexa-568 Donkey anti-rabbit and Donkey anti-mouse, and 
Alexa-647 Donkey anti-mouse (1:1,000, Life Technologies). Brain sections were 
mounted on charged slides using the mounting medium SlowFade Gold (contain- 
ing DAPI; $36938, Life Technologies), and imaged using a wide-field microscope 
(Nikon TE2000). 

For optogenetic inactivation experiments, VGluT2-Cre and VGluT2::eYFP 
mice were injected with AAV-DIO-iChloC-dsRed, (see “Viruses’) into the dmSC 
(250 nl per side, ML: £0.35, AP: 0.1 to —0.45, DV: —1.4 to —1.55) or dPAG 
(200 nl per side, ML: 0.4, AP: —0.4 to —1, DV: —2.2), with two injections 
per hemisphere along the AP axis spaced 300 jm apart. Dual optic fibres 
(400 xm diameter, 1.2 mm apart, DFC_400/430-0.48_3.5mm_GS1.2_C60; Doric 
Lenses Inc.) were implanted at the injection site. Behavioural testing was done 
10-41 days after virus injection. Mice were presented with visual or auditory 
stimuli that elicited escape, and laser-on trials were interleaved with laser-off 
trials (473 nm, 5-8 s square pulse, 15 mW mm”). For histological confirmation 
of the fibre placement and injection site, mice were decapitated under anaesthesia, 
brains were quickly removed and post-fixed in 4% PFA overnight at 4°C. Slices 
of 100 jum thickness were cut on a HM650V vibratome (Microm) in 0.1 M PBS, 
stained with DAPI before mounting, and imaged on a wide-field microscope 
(Axio Imager 2, Zeiss). 

Chemogenetic inactivation experiments. VGluT2-Cre and VGluT2::eYFP mice 
were injected with AAV-DIO-hM4D-nrxn-mCherry (see ‘Viruses’) into the dmSC 
(200-250 nl per side, ML: £0.35, AP: —0.1 to —0.45, DV: —1.4 to —1.55), with 
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2-3 injections per hemisphere along the AP axis. Dual guide cannulae were 
implanted at ML: 40.6, AP: —0.55, DV: —1.6 to target the SC-dPAG projection, 
and ML: +1.7, AP: +1.7, DV: —2.05 (angle: 7° lateral from zenith) to target the 
SC-LP thalamus projection. In experiments with optogenetic stimulation, AAV- 
DIO-ChR2-eYFP was injected into the dmSC first (coordinates and volumes as 
above) and a 200-1m optic fibre cannula was implanted at ML: +0.1, AP: —0.3, DV: 
1.35 (angle: 35° posterior from zenith). After 20-55 days, escape responses to opto- 
genetic or visual stimuli were assessed in a baseline session to estimate the stimulus 
intensities that evoke escape with Pescape= 1. Thirty minutes after microinfusion or 
ip. injection, escape responses were reassessed using the same stimuli, and, for opto- 
genetic activation, 200% of baseline intensity or frequency were tested in addition 
to the baseline strength. For cerebral microinfusions, CNO was diluted in buffered 
saline containing (in mM): 150 NaCl, 10 p-glucose, 10 HEPES, 2.5 KCl, 1 MgCh, 
and to a final concentration of 1 or 10 .M. Experiments with visual-evoked escape 
were done with 1 1M, and optogenetically-evoked escape with 1 and 10 1M. There 
was no significant difference between 1 and 10 1M at the electrophysiological and 
behavioural level, and the data have therefore been pooled (comparisons between 
1M and 10 pM CNO: ChR2-induced firing of SC VGluT2* neurons, P> 0.999 
Wilcoxon test; SC-dPAG VGluT2* EPSC amplitude, P= 0.0973 Mann-Whitney 
test; Pescape after CNO microinfusion, P= 0.6095, Mann-Whitney test). Cerebral 
microinfusions of CNO or vehicle were performed as described above using 500 
jum protruding internal cannulae (see Pharmacological Inactivation), with a volume 
of 0.6-1.0 jl per hemisphere. For i.p. injections, 1 mg CNO was dissolved in 1 ml 
0.9% saline just before the experiment and injected at a final concentration of 10 
mg kg !. Repeated administration of CNO was separated by 2-3 days, preceded 
by a new baseline session for each treatment. Histological confirmation of cannula 
placements and viral infection was performed as stated above. 
Electrophysiological recordings in acute midbrain slices. Data acquisition. 
Coronal slices were prepared from VGluT2::eYFP mice aged 6-12 weeks. Brains 
were quickly removed and transferred to ice-cold slicing solution containing 
(in mM): 87 NaCl, 26 NaHCOs, 50 sucrose, 10 glucose, 2.5 KCl, 1.25 NaH2POu, 
3 MgCh, 0.5 CaCl). Acute coronal slices of 250 jum thickness were prepared at 
the level of the SC and PAG (—4.8 to —4.1mm from bregma) using a vibratome 
(VT1200, Leica or 7000smz-2, Campden). Slices were then stored under sub- 
merged conditions, at near-physiological temperature (35°C) for 30 min before 
being cooled down to room temperature (19-23 °C). For recordings, slices were 
transferred to a submerged chamber and perfused with artificial cerebrospi- 
nal fluid (aCSF) containing (in mM): 119 NaCl, 26 NaHCOs, 10 glucose, 2.5 
KCl, 2 CaCh, 1 MgCl, 1 NaH2PO, (heated to 34°C at a rate of 2-3 ml min~!), 
All aCSF was equilibrated with carbogen (95% O2, 5% COz, final pH 7.3). 
Whole-cell patch-clamp recordings were performed with an EPC 800 ampli- 
fier (HEKA). Data was digitised at 20 kHz (PCI 6035E, National Instruments), 
filtered at 5 kHz and recorded in LabVIEW using custom software and Mantis 
software (mantis64.com). Pipettes were pulled from borosilicate glass cap- 
illaries (Harvard Apparatus, 1.5-mm OD, 0.85-mm ID) with a micropipette 
puller (P-1000, Sutter, USA or P-10, Narishige, Japan) to a final resistance of 
4-6 MO. Pipettes were backfilled with internal solution containing (in mM): 
130 potassium gluconate or KMeSOs3, 10 KCI, 10 HEPES, 5 phosphocreatine, 
2 Mg-ATP, 2 Na-ATP, 1 EGTA, 0.5 Naj-GTP, 285-290 mOsm, pH was adjusted 
to 7.3 with KOH. VGluT2* dPAG and dmSC cells were visualized on an upright 
Slicescope (Scientifica) using a 60 objective (Olympus) and identified based on 
location and eYFP expression. The resting membrane potential was determined 
immediately after establishing the whole-cell configuration and experiments were 
continued only if cells had a resting membrane potential more hyperpolarized than 
—45 mV. Input resistance (Ri) and series resistance (Rs) were monitored contin- 
uously throughout the experiment, and R, was compensated in current-clamp 
recordings. Only cells with a stable R, < 30 MO were analysed. For ChR2-assisted 
circuit mapping, recordings were made 10-51 days (mean = 22.3 + 2.3 days) after 
injection of AAV2-DIO-ChR2-mCherry into the mSC or dPAG of VGluT2::eYFP 
mice. ChR2 was stimulated with wide-field 490-nm LED illumination (pe-100, 
CoolLED, 1-ms or 10-ms pulse length, maximum light intensity =2.7 mW). To 
characterize the cellular effects of iChloC activation, PAG or dmSC VGluT2* cells 
expressing AAV5-DIO-iChloC-dsRed were recorded from at 46 days after infec- 
tion, and iChloC was stimulated with 1-s long, 490-nm light pulses. Recordings in 
mice expressing hM4D-nrxn in the dmSC were made 22-53 days after injection 
(mean = 29.4 + 3.1 days), and ChR2 was activated at 10, 20 and 100% light intensity 
(0.27, 0.54 and 2.7 mW). 

Pharmacology. No drugs were added to the recording aCSF, except for the 
following experiments: miniature EPSCs (mEPSCs) were recorded in 1 1M 
tetrodotoxin (TTX, Sigma Aldrich), and ESPC recordings shown in Extended 
Data Fig. 8 were recorded in 1 tM TTX and 100 .M 4-aminopyridine (4-AP, 
Sigma Aldrich); to test the effect of hM4D-neurexin activation on firing rates 
and synaptic transmission, 1-10 1M CNO (free base, Hellobio) was added to the 
aCSF during recordings. 


Data analysis. Analysis was performed using custom-written procedures in Python, 
except for the analysis of sEPSCs and mEPSCs which was done in IGOR Pro 6 
(WaveMetrics) using TaroTools (by Taro Ishikawa). The Rin was calculated from 
the steady-state voltage measured in response to a hyperpolarizing test pulse of 
500-ms duration at a holding potential of —60 mV. The membrane time con- 
stant was calculated by fitting the decay of the test pulse with a single exponential 
(y=y0 + Ae -*°/7)). The membrane potential values stated in the text are not 
corrected for liquid junction potentials. The sEPSC frequency before and after 
ChR2 stimulation was calculated from 6-8 repetitions per cell. Failures of light- 
evoked synaptic transmission were defined as a peak amplitude of less than the 
mean current baseline +2s.d. in a time window defined by the onset of the mean 
evoked synaptic current +5 ms. Quantal content calculated by the direct method 
was obtained by dividing the peak amplitude of the evoked current by the peak 
amplitude of the sEPSCs in the same cell (which is not significantly different from 
the mEPSC amplitude, see Extended Data Fig. 8f-h), and the Poisson estimation 
was calculated as In(failure rate)~! (refs 39,40). The paired-pulse ratio was calcu- 
lated as the ratio of peak amplitudes between the second and first EPSCs in a train. 
Effects of drug application were calculated after a perfusion time of at least 10 min. 
Statistical analysis was performed on cells pooled across mice. 

Single unit recordings. Data acquisition. Neuropixels silicon probes (phase3, 
option1, 384 channels*!) were used to record extracellular spikes from dmSC neurons 
in three male adult C57BL/6] wild-type mice. A craniotomy was made over the SC 
and sealed with Kwik-Cast, followed by attachment of a metal custom-made head- 
plate and ground pin to the skull, using dental cement. At least 36 h after surgery, 
mice were placed on a plastic wheel and head-fixed at an angle of 30° from the 
anterior-posterior axis, parallel to an LCD monitor (Dell, 60-Hz refresh rate) centred 
30 cm above the head. Before recording, the probe was coated with Dil (1 mM in 
ethanol, Invitrogen) for track identification and a wire was connected to the ground 
pin for external reference and ground. For recording, the probe was slowly inserted 
into the SC (AP: —0.5 to —0.7, ML: 0.4 to 0.8) to a depth of 2.8-3.0 mm and left in 
place for at least 20 min before the beginning of the recording session. Data was 
acquired using spikeGLX (https://github.com/billkarsh/SpikeGLX, Janelia Research 
Campus), high-pass filtered (300 Hz), amplified (500 x), and sampled at 30 kHz. 
Sensory stimuli were delivered and synchronized using custom-made LabVIEW 
software. Mantis software (mantis64.com) and a PCle-6353 board (National 
Instruments). Visual and auditory stimuli (98% contrast; 50% contrast; sound) 
were presented interleaved with a 1-min interval and a total of 30 presentations each. 
Data analysis. Analysis was performed in MATLAB 2017a. Raw voltage traces 
were band-pass filtered (300-5,000 Hz), spikes were detected and sorted auto- 
matically using JRCLUST”, followed by manual curation. Only units with a clear 
absolute refractory period in the auto-correlogram were classified as single units. 
Firing-rate histograms were calculated as the average firing rate in bins of 1 ms 
for 30 consecutive trials, and subsequently smoothed. Units were considered to 
respond to the threat stimulus if their firing rate increased by at least 1 Hz ina 
500-ms time-window from stimulus onset when compared to the baseline (500 ms 
before stimulus onset). Peak firing rates for each stimulus were calculated as the 
mean of a 30-ms time window centred on the time of the average peak firing rate 
of all responding units. Responses to 50% contrast visual stimuli were calculated 
on all units that responded to 98% contrast. For units showing persistent activity 
after stimulus offset, the time constant to decay to baseline was obtained by fitting 
a single exponential to the average firing rate histogram. Statistical analysis was 
performed on single units pooled from all mice. 

Retrograde tracing. For monosynaptic rabies tracing’? from the dPAG, TVA and 
RG were injected unilaterally into the dPAG* with an angled approach from the 
contralateral hemisphere to avoid infection of the SC in the target hemisphere (20°, 
AP: —0.45 to —0.5, ML: —0.6, DV: —2.2). EnvA-dG-RV-mCherry was injected 
into the dPAG vertically (AP: —0.4, ML: +0.5, DV: —2.1) 10-14 days later. Mice 
were perfused seven days post-rabies virus injection. Brains were cut at 100-1m 
thickness on a microtome (HM650V, Microm). All sections containing the PAG 
and SC were mounted in SlowFade Gold, and imaged using a confocal microscope 
(SP8, Leica). Tile scans of the entire section were acquired with a 25 x water objec- 
tive (Olympus) at five depths (10 j1m apart) and maximum projections of these 
stacks were used for subsequent analysis. Cell counting was done manually (Cell 
counter plug-in, Fiji) in reference to the Franklin and Paxinos atlas*®. To quantify 
the position of presynaptic SC cells along the mediolateral axis, the coordinates 
of the counted cells were normalized to the medial and lateral extents of the SC 
for each brain slice, and a kernel density estimation was performed (Scikit-learn, 
Python). For retrograde tracing from the dmSC, rAAV2-retro-mCherry was 
injected unilaterally. AAV2-CamkII-GFP was co-injected to label the injection 
site in two out of three brains. Mice were euthanized 14-18 days afterwards and 
their brains processed as described above. Additionally, rabies tracing from the 
mSC was performed in three mice, and as described above. Every third section 
along the rostrocaudal axis of the SC was imaged with on an Axio Imager 2 (Zeiss) 
and presynaptic cells in the dPAG and auditory cortex were counted manually. 
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Histological quantifications. To estimate the fraction of VGluT2° cells in a target 
area that were infected with viral vectors, we compared the density of infected cells 
in VGluT2-Cre mice at the implant site, to control densities quantified using the 
VGluT2::eYFP reporter line. Optogenetic vectors infected 86 + 6% for dPAG and 
95+9% for mSC; GCaMP6s infected 90 + 8% for dPAG and 86 + 1% for mSC; 
hM4D infected 93 + 15% for mSC. The placement of optic fibres, GRIN lenses and 
cannulae was assessed histologically based on their tract and tip location, and their 
tip locations are illustrated in the respective sections of the mouse brain atlas*® (see 
Extended Data Figs. 2, 4, 6 and 10). 

General data analysis. Data analysis was performed using custom-written rou- 
tines in Python 2.7 and custom code will be made available on request. Data are 
reported as mean + s.e.m. unless otherwise indicated. Statistical comparisons 
using the significance tests stated in the main text were made in SciPy Stats and 
GraphPad Prism, and statistical significance was considered when P < 0.05. Data 
were tested for normality with the Shapiro- Wilk test, and a parametric test used 
if the data were normally distributed, and a non-parametric otherwise, as detailed 
in the text next to each comparison. 

Reporting summary. Further information on experimental design is available in 
the Nature Research Reporting Summary linked to this paper. 

Data availability. The datasets generated and/or analysed in this study are available 
from the corresponding author upon reasonable request. 
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Extended Data Fig. 1 | Behaviour metrics computed over single mice. 

a-c, Summary plots for escape behaviour metrics calculated for each 

mouse individually and averaged. Plots on the left were obtained with 

data from all trials, and in the plots on the right, trials for each contrast 

were split in half and the behaviour metrics calculated for each half. There 

is a significant dependency on contrast for all metrics (reaction time, 

a: P=3.5 x 10-8; escape probability, b: P=2.1 x 10~7; escape vigour, 

c: P=1.6 x 10~°, repeated measures ANOVA), and no significant 
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difference between the metrics calculated using the first and second half 
of the trials (P > 0.4 for a main effect of trial group in all comparisons, 
two-way repeated measures ANOVA), indicating that behavioural 
performance was stable across repeated presentations of the stimulus. 
Error bars and shaded areas are s.e.m. d, Escape probability after the first 
(as shown in Fig. le, calculated by pooling all data) and fifth spot, during 
the presentation of five consecutive expanding spots. 
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from 6 dPAG and 3 dmSC cells. c, For the 18% of trials in which VGluT2* permission from Elsevier. 
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Extended Data Fig. 3 | Muscimol inactivation of dPAG and mSC 
abolishes escape while V1 and amygdala have a modulatory effect on 
escape behaviour. a, Top, example images of muscimol infusion in the 
dPAG (left) and mSC (right), and respective speed traces in response to 

a threatening visual stimulus (bottom) showing a switch from escape to 
freezing after dPAG inactivation and a loss of defensive responses after 
mSC muscimol inactivation. b, Summary quantification of the effect of 
muscimol infusion on threat-evoked defensive behaviour probability 

in the dPAG (left; 1 =7 mice, P=0.0001 for escape and P= 0.00025 for 
freezing, U-tests) and mSC (right; n = 10 mice, P=0.00021 for escape 
and P=0.051 for freezing, U-tests). c, Top, images of bilateral muscimol 
infusion in the amygdala (left) and visual cortex area V1 (right). 
Respective speed traces during threatening visual stimulus presentation 
(bottom) show that mice still engage in escape behaviour, but with reduced 
vigour. d, Summary quantification for escape probability (left) and vigour 
(right) after amygdala and V1 acute inactivation (amygdala: n = 4 mice, 
P=0.37 for escape probability, U-test; P=0.01 for escape vigour, 
two-tailed t-test; V1: n =4 mice, P=0.5 for escape probability, U-test; 
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P=0.01 for escape vigour, two-tailed t-test). e, Example speed traces 
showing that vehicle infusion in the mSC and dPAG does not change 
threat-evoked escape probability, and respective summary quantification. 
f, Infusion of mSC and dPAG with vehicle does not affect escape 
probability (mSC; n =5 mice, P=0.21, U-test; dPAG; n=5 mice, P=0.21, 
U-test). g, Infusion of mSC and dPAG with muscimol or vehicle does 

not affect running speed during exploratory behaviour (mSC: P=0.8 

for vehicle, P= 0.22 for muscimol; dPAG: P= 0.28 for vehicle, P=0.75 
for muscimol, paired t-tests). h, Profile of exploratory behaviour for 
behavioural sessions lasting at least 40 min, after injection of vehicle 

or muscimol in the mSC and dPAG. The displacement over time for all 
conditions is not significantly different to the profile for multiple trials of 
visual threat stimulation in control conditions (dashed black line, same 
data as shown in Extended Data Fig. 5e; P > 0.1 for all comparisons with 
control, two-tailed t-test). Thin lines show individual mice and thick lines 
show the dataset mean. Box-and-whisker plots show median, IQR and 
range. 
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Extended Data Fig. 4 | See next page for caption. 
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Extended Data Fig. 4 | The reliability and fraction of active cells is 
stable over multiple trials of calcium imaging, activity in the dmSC 
does not reflect head rotation and rises with different slopes, and 

dPAG activity is specific to escape. a, b, Example images of GCaMP6s 
expression in VGluT2° cells (green), with schematic showing GRIN lens 
placement in the dPAG (a) and dmSC (b). ¢, d, Raster plots showing active 
(colour squares) and non-active cells (black squares) in a single FOV 
imaged over multiple trials. A total of 8 FOVs were imaged in the dPAG (c) 
with a mean of 18 cells per FOV (range = 7-30) and 11 trials per FOV; 

and in the dmSC (d), 11 FOVs were imaged with a mean of 20 cells per 
FOV (range = 7-31) and 20 trials per FOV. There was a mean of 7 escape- 
responding cells per dPAG FOV and 16 escape-responding cells per dmSC 
FOV. e, f, Reliability of escape-responding cells showing a response over 
multiple trials for all trials (left) and for the first and second half of trials 
separately (right). Mean reliability across all trials was 28 + 3% for dPAG 
and 35 +3% for dmSC, and stable over multiple trials (P = 0.44 for dPAG, 
P=0.11 for dmSC, comparison between the two groups of trials, U-test). 
g, h, Fraction of all cells in a FOV that were active on each trial for all 
trials (left) and for the first and second half of trials separately (right). 

The active fraction across all trials was 14 + 3% for dPAG and 23 + 6% 

for dmSC, and stable over multiple trials (P = 0.21 for dPAG, P= 0.08 for 
dmSC, comparison between the two groups of trials, U-test). i, Correlation 
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between the rise slope of the population activity and escape latency (n=75 
trials, P= 0.0048, Pearson's r). j, Average population calcium signal in 

the dmSC for escape trials in response to 98% contrast spots and sound 
stimuli. The slope of the signal rise is steeper for sound-evoked escape. 

k, Left, ROC AUC for the dmSC signal before spontaneous escape onset 
after conditioning (AUC at escape onset = 0.74, significantly above chance 
2.1 s before escape, n = 57 trials). Right, average population calcium 

signal in the dmSC during threat-evoked escape trials where the mouse 
was already facing the shelter and therefore did not rotate the head (n=5 
trials). 1, Summary quantification of dPAG population calcium signals 
during threat-evoked escape and spontaneous foraging running bouts of 
similar speed (top; n = 6 escape trials and n = 6 running bouts, speed not 
significantly different, P= 0.64, t-test), showing that activity increase in 
the dPAG is specific for escape (bottom; P= 0.0018, t-test). Shaded areas 
show s.e.m., box-and-whisker plots show median, IQR and range. 

m, Correlation between the population activity of dPAG (top; n = 39 trials, 
P=6.7 x 107’, Pearson's r) and dmSC (bottom; n= 64 trials, P= 0.04, 
Pearson's r) and escape speed. Each data point is a single trial. 

n, Placement of GRIN lenses in the dmSC (magenta circles) and dPAG 
(blue circles), coordinates are in mm and from bregma. Mouse brain 
images adapted from ref. “° and reproduced with permission from Elsevier. 
Box-and-whisker plots show median, IQR and range. 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


Baseline period (8 min) 


Test period (30 min) 


ra, 


Nees 


Threat zone 
STZ NG 
ry Wh 


Set 
J 


<< 
os 


Control 
+S 


<A 


0 
Probability (10°) 


Probability (10°) 


b c 
505 _ = 20 
E a 
— ~ i 
x oO 
3° a el 
5 @ 5 = 2 
8 37°" 8 8 i 2 
= s ®& 404 _ § 
- 21 — 3 8 ie 
_— fe} ££ 
E |g 8 0.12 
e465 5 0.5 
| Q 
i 
a 
0 aH 8 0 = 
2 0. 
6 
0.30 60 


Conditioned Escape 


oO . 
c at 
S 3 ei 
~ 3 0: 
= 2 aH 
i= =O = > 0: 
ORE 2 2 = 40 Bi 
© a 8 Ss Bi 
a 3 3 Ei 
2 6. 0.15 i Q Fi 
£ g rot 7) H 
2 ® = 4 
5 ta 6 2 20 
iS wi 3) Q 
fo} = 
oO +r 2s 
s | 
== = 
B 
———1 + 0 
0 5 t : 0.0 
Probability (10°) Probability (10) 
e f 
8 8-5 
No stimulation Aversive conditioning 
<7 oe © 77 
E Multiple contrast stimulation £ 
— 6 — 64 
5 5 : stimulation start 5 
£ : £ 
8 4 8 
& & 
& 3 a 
me] mo} 
8 2 8 
| = 
fe) i) 
= 1 = 
0 0 ~ T T r 1 
0 10 20 30 40 


Time (min) 


Extended Data Fig. 5 | Repeated high-contrast visual stimulation causes 
place aversion, reduction in exploration and spontaneous escape. 

a, Traces and probability distributions for the location of two example 
mice during free exploration (top), and before and after a high-contrast 
visual stimulation conditioning paradigm (bottom), showing avoidance 

of the threat area after conditioning (bottom right). b, Time spent in the 
threat area decreases with aversive conditioning (35.1 + 3.5% for naive 
mice versus 5.1 + 2.0% after conditioning, n =7 mice, P=2.2 x 107°, 
two-tailed t-test). c, The frequency of visits to the threat area by the 

mice decreases significantly after conditioning (1.51 + 0.10 visits per 

min for naive mice versus 0.30 + 0.12 after conditioning, n =7 mice, 

P=1 x 10, two-tailed t-test). d, Summary quantification of spontaneous 
escape probability (left) and single trial speed traces from three mice 
(right) showing spontaneous escape after conditioning (P.pontaneous escape 


Time (min) 


3.2 £ 0.8% for naive mice, n=7 mice, and 12.2 + 2% after conditioning, 
n= 13 mice; P=0.004, two-tailed t-test). e, Profile of exploratory 
behaviour during behavioural sessions of multiple contrast stimulation 
(black, data taken from the mice that generated the dataset for Fig. 1) with 
no stimulation for comparison (orange). Exploration decays over time and 
the decay is accelerated by visual stimulation, but the two curves are not 
significantly different over time (2.4+ 0.3 m min! at 40 min for control 
versus 2.0 + 0.3 with visual stimulation, P= 0.16, two-tailed t-test). 

f, Same quantification as in e for sessions of aversive conditioning. 
Aversive conditioning significantly reduces exploratory behaviour 
(1.2£0.3 m min“! after conditioning, P= 0.018 versus no stimulation 
and P=0.039 versus multiple contrast stimulation, two-tailed t-test). Thin 
lines show individual mice monitored for 40 min and thick lines show the 
dataset mean. Box-and-whisker plots show median, IQR and range. 
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Extended Data Fig. 6 | See next page for caption. 
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Extended Data Fig. 6 | Optogenetic activation of dPAG and mSC elicits 
escape over a range of frequencies, and mSC VGluT2::ChR2-evoked 
escape is abolished by inactivating the PAG, but not the PBGN. 

a, Optic-fibre placements for ChR2 stimulation in the dmSC (magenta 
circles) and dPAG (blue circles), coordinates are in mm and from bregma. 
Mouse brain images adapted from Franklin and Paxinos** and reproduced 
with permission from Elsevier. b, Example speed traces for dPAG (left) 
and mSC (right) ChR2 stimulation at different frequencies (10 pulses) 
and high light intensities, showing robust escape behaviour for 5 to 40 Hz 
stimulation. c, Left, speed traces for 473-nm light stimulation (40 Hz, 

30 pulses) of one mouse expressing eYFP in the dPAG (dark green), 
showing no change in running speed. Light green traces show similar 
speed profiles for the same mouse entering the stimulation area with the 
light off. Blue dashed traces are from a different mouse expressing ChR2 
in the dPAG (40 Hz, 10 pulses), for comparison. Right, summary data 


for eYFP control stimulation in dPAG (running speed not significantly 
different between laser on and off, n = 236 trials from 3 mice, P= 0.48, 
U-test). d, Image showing expression of ChR2-eYFP in the mSC (green) 
with projections to the PBGN (yellow) and muscimol infusion (orange). e, 
Speed traces for spot-evoked escape responses from one mouse before and 
after acute PBGN inactivation. f, Summary data for escape probability and 
vigour during mSC optogenetic stimulation and PBGN acute inactivation, 
showing no difference (n =3 mice, P= 0.80 for escape probability; 
P=0.70 for escape vigour, U-test). g, Image showing expression of ChR2- 
eYFP in the mSC (green) and muscimol infusion in the PAG (orange). 

h, i, Speed traces (h) and summary data (i) showing that mSC ChR2- 
evoked escape is abolished by PAG acute inactivation (n =3 mice, 
P=0.0297 for probability, U-test). Box-and-whisker plots show median, 
IQR and range. 
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Extended Data Fig. 7 | See next page for caption. 
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Extended Data Fig. 7 | dPAG neurons receive input from mainly 
excitatory cells in the SC and do not project back to the SC. a, Image 
showing starter dPAG VGlut2* cells expressing both TVA-GFP and 
RV-mCherry and presynaptic cells expressing RV-mCherry only (left), 
and corresponding schematic (right) illustrating the position of starter 
dPAG (blue) and presynaptic SC cells (pink) across deep, intermediate 
and superficial SC layers (same as shown in Fig. 4a). b, Kernel density 
estimation curves for the axial position of presynaptic SC cells for each 
layer (82.9 + 2.6% of 1,770 cells are located within the medial bisection 
of ipsilateral SC, n = 3 mice). c, Image showing presynaptic cells in the 
mSC infected with rabies virus (red) from starter neurons in the dPAG of 
a VGluT2::eYFP mouse (left). Box indicates area magnified shown on the 
right. Yellow cells are VGluT2* mSC presynaptic neurons. d, Summary 


quantification of the percentage of presynaptic cells in the mSC that 
express VGluT2* (mean = 87.9 + 1.0%, n=4 mice). e, Image showing 
injection of rAAV2-retro in the mSC (left) and no retrogradely labelled 
cells in the dPAG (bottom, left), while retrograde labelling is present in 
the auditory cortex for comparison (bottom, right). Similarly, rabies virus 
injected in the mSC shows a lack of presynaptic cells in the dPAG (right), 
suggesting a predominantly feed-forward connectivity arrangement 
between the mSC and dPAG (note, however, that it cannot be excluded 
that both rAAV2-retro and rabies display selective tropism that prevents 
labelling of dPAG neurons). f, Summary quantification for retrogradely 
labelled cells in the dPAG and auditory cortex after mSC rAAV2-retro 
(n=3 mice) or rabies infection (n =3 mice). Box-and-whisker plots show 
median, IQR and range. 
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Extended Data Fig. 8 | Biophysical properties of excitatory dPAG 
neurons and synaptic properties of the dmSC-dPAG excitatory 
connection. a, Example trace of current step injections in a VGluT2* 
dPAG cell (left) and summary current-frequency relationship (right, 
shaded area is s.e.m.). b, Summary quantification of resting membrane 
potential (mean = —61.4+ 2.15), input resistance (mean = 0.55 + 0.05 GQ) 
and membrane time constant (mean = 28.3 +3 ms) for VGluT2+ dPAG 
cells (n= 14 cells, n=7 mice). c, Example current traces for one dPAG 
VGluT2* cell showing optogenetically evoked EPSCs from the dmSC 
(left) that are blocked by TTX (middle) and recovered by 4-AP (right), 
confirming the presence of a monosynaptic connection. d, Summary 
data for peak dmSC-dPAG EPSC amplitudes and connectivity rate in the 


presence of TTX and 4-AP. e, Summary data showing that the properties 
of the dmSC-dPAG connection do not change with number of days after 
viral transfection of ChR2, and remain weak and unreliable (n = 15 mice, 
P=0.78, 0.51 and 0.33 for amplitude, failure rate and connectivity rate, 
respectively, Kruskal-Wallis test). Plots show mean and s.e.m. f, Average 
waveforms for sEPSCs and mESPCs (recorded in TTX) in one cell, and 
respective cumulative histogram for peak amplitudes. g, Peak amplitude 
of sEPSCs and mEPSCs is not significantly different (n = 4 cells, P= 0.18, 
0.79, 0.9 and 0.36 respectively, Kolmogorov—Smirnov test for 100 events 
in each condition per cell). Box-and-whisker plots show median, IQR and 
range. 
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Extended Data Fig. 9 | Silicon probe anatomical placement and Left, raw voltage trace from the channel with the strongest signal for the 
examples of dmSC single units. a, Example image showing the track left unit of interest (black symbols below indicate all spikes detected for the 
by one probe stained with Dil, superimposed on a bright-field image of unit). Middle, auto-correlogram of spike times calculated in bins of 
a 30-\.m sagittal slice. b, Schematic illustrating the probe track in each 1/30 ms. Right, superimposed action potential waveforms chosen 
mouse (sagittal section, 0.6 mm lateral to the midline). Mouse brainimage randomly from the whole recording (light colour) and average waveform 
adapted from Franklin and Paxinos“ and reproduced with permission (dark colour). 


from Elsevier. c, Two examples of dmSC single units (top and bottom). 
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Extended Data Fig. 10 | Controls and cannulae placements for 
chemogenetic inactivation experiments. a, Summary in vitro data 

for hM4D-neurexin/ChR2-expressing VGluT2* dmSC neurons before 
(baseline) and after CNO application (CNO), showing no effect of CNO 
on action potential firing in response to current injection (left, n= 6 
cells, P= 0.8738 for main effect of CNO, two-way repeated measured 
ANOVA; inset shows example traces to two current steps) or to 473-nm 
light-evoked ChR2 activation (right, n =9 cells, P= 0.7006 for main 
effect of CNO, two-way repeated measured ANOVA). Error bars are 
s.e.m. b, Application of CNO reduces dmSC-dPAG excitatory synaptic 
transmission by 71 +7% (n= 10 cells, P=6.19 x 10~°, two-tailed t-test 
between baseline and CNO). c, Disrupting mSC-dPAG synapses with 
CNO microinfusion in behaving mice blocks visually evoked escape 


@ CNO (ChR2 stimulation) 
@ CNO (Spot stimulation) 


behaviour (n =3 mice, P= 0.036, U-test). d, Doubling the intensity 

or frequency of mSC stimulation while locally blocking mSC-dPAG 
synapses is not sufficient to rescue escape behaviour (n=5 mice, P=0.11 
for intensity, U-test; P= 0.42 for frequency, U-test; both comparisons 
against escape probability after local block in baseline conditions shown 
in Fig. 41). e, Cannula placements for local inactivation experiments with 
CNO at the SC-PAG synapse (left) and at the SC-LP synapse (right). The 
tip of the internal cannulae is indicated by yellow circles (for experiments 
with optogenetic stimulation of dmSC VGluT2* cells) and brown circles 
(for experiments with visual stimulation). Coordinates are in mm and 
from bregma. Mouse brain images adapted from Franklin and Paxinos*® 
and reproduced with permission from Elsevier. Box-and-whisker plots 
show median, IQR and range. 
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Malignancy is accompanied by changes in the metabolism of 
both cells and the organism!. Pancreatic ductal adenocarcinoma 
(PDAC) is associated with wasting of peripheral tissues, a metabolic 
syndrome that lowers quality of life and has been proposed to 
decrease survival of patients with cancer**. Tissue wasting is a 
multifactorial disease and targeting specific circulating factors to 
reverse this syndrome has been mostly ineffective in the clinic”. 
Here we show that loss of both adipose and muscle tissue occurs 
early in the development of pancreatic cancer. Using mouse models 
of PDAC, we show that tumour growth in the pancreas but not in 
other sites leads to adipose tissue wasting, suggesting that tumour 
growth within the pancreatic environment contributes to this 
wasting phenotype. We find that decreased exocrine pancreatic 
function is a driver of adipose tissue loss and that replacement 
of pancreatic enzymes attenuates PDAC-associated wasting of 
peripheral tissues. Paradoxically, reversal of adipose tissue loss 
impairs survival in mice with PDAC. When analysing patients with 
PDAC, we find that depletion of adipose and skeletal muscle tissues 
at the time of diagnosis is common, but is not associated with worse 
survival. Taken together, these results provide an explanation for 
wasting of adipose tissue in early PDAC and suggest that early loss of 
peripheral tissue associated with pancreatic cancer may not impair 
survival. 

Models of autochthonous mice with PDAC recapitulate many 
features of human disease, including cachexia”’®. We confirmed the 
occurrence of severe adipose tissue and skeletal muscle wasting in 
advanced disease using two autochthonous mouse models with 
Kras(G12D) activation and loss of Trp53 function—either via Trp53 
deletion (KP~/~C) or mutant Trp53™ 7H expression (KPC) (Fig. la-f). 
Because protein breakdown in tissues occurs early in PDAC?, we 
assessed the kinetics of tissue wasting in the KP~/~C model and 
found a decrease in the mass of the adipose tissue by six weeks of age, 
whereas the weight of the pancreatic tissue was unchanged and plasma 
branched-chain amino acids (BCAAs), a measurement of peripheral 
tissue wasting’, were modestly increased (Fig. 1g-i and Extended Data 
Fig. 1a). Adipose tissue wasting was greater than skeletal muscle wast- 
ing throughout disease progression (Fig. 1g, h), consistent with adipose 
tissue depletion occurring before skeletal muscle loss'®"'. 

Histological analysis of gastrocnemius skeletal muscle in 6-week-old 
KP-/~C mice (early KP-/C) suggests muscle atrophy as characterized 
by a decrease in the cross-sectional area of myofibres (Extended Data 
Fig. 1b, c). Using micro-computed tomography imaging of the lower 
leg muscles (gastrocnemius skeletal muscle and soleus), we found a 
decreased muscle volume in early KP~/~C mice (Extended Data 


Fig. 1d, e). We also observed decreased muscle mass and elevated 
expression of genes involved in autophagy and ubiquitin—-proteosome 
degradation, suggesting activation of muscle breakdown (Extended 
Data Fig. 1f, g). Histological analysis of adipose tissue also revealed 
smaller adipocytes and enhanced lipolysis in early KP~/~C mice 
(Extended Data Fig. 1h-l), suggestive of a reduction in triglyceride syn- 
thesis or triglyceride storage. These results are consistent with enhanced 
lipolysis!*!° and muscle proteolysis'+ in advanced cancer; however, in 
KP~!~C mice these changes occur by six weeks of age, suggesting that 
peripheral tissue loss is initiated early in PDAC. 

Although the KP~/~C model has predictable disease progression’, 
there is widespread pancreatic transformation and cancer progresses 
in young mice such that tissue depletion could involve a failure to 
gain mass. To determine whether peripheral tissue wasting occurs in 
other PDAC models with a longer time until disease onset, we assessed 
adipose and skeletal muscle mass in mice in which oncogenic Kras@!?? 
alone is activated in the pancreas (KC). KC mice develop premalig- 
nant lesions that progress to invasive tumours with advanced age’. 
In 15-week-old KC mice, an age preceding invasive cancer, we also 
observed decreased adipose and skeletal muscle mass (Fig. 1j-] and 
Extended Data Fig. 1m). These data indicate that peripheral tissue 
wasting in PDAC is initiated early and before frank cancer onset. 

To test whether a PDAC-derived systemic factor mediates early 
tissue wasting in PDAC, we implanted PDAC cells isolated from 
autochthonous tumours either subcutaneously or orthotopically into 
the pancreas of syngeneic mice (Fig. 2a). When implanted at either 
location, solitary tumours form with similar histological features 
(Fig. 2b). Although mice were injected with the same number of cells, 
only the mice bearing orthotopic pancreatic tumours developed 
marked adipose wasting (Fig. 2c, d). Subcutaneous tumours are defi- 
cient in stromal components and may display reduced desmoplasia, 
although similar trichrome staining was observed for subcutaneous 
and orthotopic tumours, suggesting similar fibrosis (Fig. 2b). We 
isolated and immortalized pancreatic stellate cells (PSCs) from syn- 
geneic mice’® and injected PDAC cells alone or together with PSCs 
to form subcutaneous or pancreatic orthotopic tumours and tested 
whether this major component of the stroma contributes to periph- 
eral tissue loss. Although we found that addition of PSCs did not alter 
peripheral tissue wasting, the location of the tumour still contrib- 
uted to tissue wasting (Fig. 2e, f). We next analysed secreted factors 
that have been reported to be increased in late-stage PDAC and/or 
known to induce cachexia in other models*!® and found no differ- 
ences between control and early KP~/~C mice (Extended Data Fig. 2). 
These results indicate that tumour growth in the pancreas, rather than 
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Fig. 1 | Early PDAC is associated with 
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a known systemic circulating PDAC-derived factor, promotes early 
wasting of the adipose tissue. 

To investigate how tumour growth in the pancreas promotes adipose 
tissue wasting, we assessed systemic O2 consumption, CO production 
and calculated the respiratory exchange ratio. Both control and early 
KP~/~C mice displayed similar respiratory exchange ratios (Fig. 2g), 
arguing against a shift in whole-body fuel source utilization. However, 
O, consumption and CO, production were lower in early KP~/~C 
mice (Fig. 2h, i), suggesting decreased nutrient oxidation. Because 
food intake was similar between groups (Fig. 2j), early KP ~'—C mice 
may metabolize less food and altered pancreatic function could explain 
these findings as well as the loss of adipose tissue. 

To assess endocrine pancreatic function, we measured plasma 
glucagon and insulin. Whereas we found no significant differences in 
glucagon levels (Fig. 3a), early KP-/~C mice had lower levels of insulin 
in the fed state (Fig. 3b). To test for an insulin secretion defect, we 
measured plasma insulin after a bolus glucose injection and found no 
significant differences (Fig. 3c). These results indicate that the func- 
tion of the endocrine pancreas was not altered in early PDAC; rather, 
impaired dietary absorption or starch breakdown may have caused the 
lower fed insulin levels. Consistent with this idea, we observed lower 
fed blood glucose levels in early KP~/~C mice (Fig. 3d). Furthermore, 
reduced insulin levels may promote increased lipolysis and adipose 
tissue loss. 

To assess exocrine pancreatic function, we measured faecal lipid 
content and found higher faecal lipids in early KP~/~C mice (Fig. 3e). 
We also found decreased faecal protease activity and increased faecal 
protein content in these mice (Fig. 3f, g). Orthotopic implantation of 
PDAC cells into the pancreas also led to increased faecal protein content 
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during disease progression (Fig. 3h). Decreased exocrine function of 
the pancreas could explain why early KP~/~C mice had decreased 
levels of O2 consumption and CO) production despite normal 
food intake, leading to a starvation-like response with mobilization 
of energy stores from peripheral tissues. Indeed, mice bearing sub- 
cutaneous PDAC tumours that were fed a calorically restricted diet 
showed increased adipose tissue loss compared to skeletal muscle loss 
and reduced tumour size (Extended Data Fig. 3a—d). 

To test whether decreased exocrine pancreatic function contributes 
to tissue wasting in PDAC, we supplemented a diet with pancreatic 
enzymes (Fig. 3i). Providing pancreatic enzymes attenuated adipose 
wasting in mice with PDAC (Fig. 3j). Furthermore, whereas mice with 
early PDAC displayed decreased fed glucose levels (Fig. 3d), mice with 
PDAC fed a diet supplemented with pancreatic enzymes displayed sim- 
ilar glucose levels to control littermates (Extended Data Fig. 3e). To 
control for potential food intake differences associated with adding 
pancreatic enzymes to a diet, we pair-fed mice to assure similar food 
consumption, and again observed attenuated adipose tissue loss in 
PDAC when providing pancreatic enzymes (Extended Data Fig. 3f-i). 
These results confirm that decreased pancreatic exocrine function 
mediates adipose tissue loss and contributes to peripheral tissue wast- 
ing in mice with early PDAC. 

Cachexia has been proposed to worsen patient survival in various 
cancers including PDAC*“. To determine whether adipose tissue wast- 
ing limits survival in PDAC, we assessed whether supplementation 
with pancreatic enzymes improved disease outcome. Despite reduced 
adipose tissue wasting, supplementation with pancreatic enzymes 
decreased survival of mice with PDAC (Fig. 3k), suggesting that periph- 
eral tissue wasting may not always limit survival. 
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Fig. 2 | Pancreatic tumour growth promotes adipose tissue wasting. 

a, Schematic of experimental design. b, Representative haematoxylin and 
eosin (H&E) and Masson’s trichrome histology of subcutaneous and 
orthotopic pancreatic tumours. n= 4. c, d, Epididymal adipose tissue and 
gastrocnemius skeletal muscle mass in mice injected with saline (PBS) or 
PDAC cells. c, Mice bearing subcutaneous allografts. n = 5 PBS-injected 
mice and 9 mice injected with PDAC cells. P=0.83. d, Mice bearing 
pancreatic orthotopic allografts. n = 5 PBS-injected mice and 7 mice 
injected with PDAC cells. P< 0.0001. e, f, Epididymal adipose tissue and 
gastrocnemius skeletal muscle mass in mice injected with PDAC cells or 
PDAC cells together with PSCs. e, Mice bearing subcutaneous allograft 


To investigate the association between peripheral tissue wasting and 
patient survival, we identified 782 patients at five US cancer centres 
with previously untreated PDAC, available clinical and outcome data, 
and banked blood samples (Extended Data Table 1). We quantified lum- 
bar visceral and subcutaneous adipose tissue areas using pretreatment 
computed tomography scans!”"'® (Extended Data Fig. 4a). Although 
adipose tissue area was associated with multiple clinical factors 
(Extended Data Table 2), no association was found between adipose 
tissue wasting and patient survival in the full population (Table 1) or 
by disease stage (Extended Data Table 3a). 

Studies of muscle wasting and survival of patients with PDAC have 
led to conflicting results!”!*-*”, potentially because of differences in 
study design. We next used computed tomography imaging to measure 
the lumbar skeletal muscle index, a marker of muscle mass'”!??%?, 
Using previously established cut points for sarcopenia’*, we found that 
65% of patients displayed sarcopenia at diagnosis, and these patients 
did not have worse survival (Table 1). Notably, the prevalence of sar- 
copenia was not different between disease stages (localized, 64%; 
locally advanced, 70%; metastatic disease, 63%; P=0.40). Because an 
interaction between sarcopenia and body mass index has been pro- 
posed”®?, we evaluated whether sarcopenic obesity was associated 
with worse patient survival and observed no differences in survival 
between different patient groups (Table 1). Because optimal sarcopenia 
cut points are not well-defined", we investigated an agnostic skeletal 
muscle index classification with gender-specific quintiles and found 


Control 


Early 
KP--C 


Night 


tumours. n=6 per group. P= 0.63. f, Mice bearing pancreatic allograft 
tumours. n =6 per group. g-j, Metabolic cage measurements in male 
control (n = 8) and early KP~/~C (n=7) mice. g. Respiratory exchange 
ratio. h, Relative volumetric oxygen consumption (V,,) calculated from 
area under the curve (AUC) measurements during the day and night. 
P=0.06 (day) and P= 0.129 (night). i, Relative volumetric carbon dioxide 
release (Vcg,) calculated from AUC measurements during the day and 
night. P=0. 102 (day) and *P=0.01 (night). j, Food intake (g consumed 
per day. n=6 per group. P=0.45. Unless otherwise indicated, statistical 
analysis was performed using unpaired two-sided t-tests, data are 

mean + s.e.m. and n represents the number of mice that were analysed. 


no association with patient survival (Extended Data Table 3b). We also 
examined skeletal muscle area and attenuation and did not identify 
an association of these markers with patient survival (Extended Data 
Table 3b). Finally, because plasma BCAAs elevations reflect tissue wast- 
ing in early PDAC?, we evaluated whether plasma BCAAs at diagnosis 
were associated with reduced survival and did not identify worse out- 
comes with elevated BCAAs (Extended Data Tables 4-6 and Extended 
Data Fig. 4b). Thus, in this multi-institutional patient population with 
newly diagnosed, previously untreated PDAC, we found no evidence 
that early skeletal muscle or adipose tissue wasting was associated with 
worse survival. 

Independent of effects on survival, assessing peripheral tissue loss 
before overt disease onset may help to identify PDAC at earlier stages. 
In mouse models, we found that decreased exocrine function contrib- 
utes to adipose tissue wasting. Many patients with PDAC experience 
loss of exocrine pancreatic function”; however, whether decreased 
exocrine function contributes to wasting in patients requires further 
study. Furthermore, correction of pancreatic exocrine function in 
mouse models reduces adipose tissue wasting but does not significantly 
affect muscle loss, suggesting that supplementation with pancreatic 
enzymes is either insufficient to fully restore the nutritional state or 
that additional factors contribute to muscle wasting. These other factors 
may include stroma-derived inflammatory signals that were not tested 
in this study. Nevertheless, the findings that a starvation-like state in 
mouse models of PDAC contributes to adipose tissue loss and increases 
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Fig. 3 | Decreased exocrine pancreatic function in early PDAC disease 
promotes tissue wasting. a—d, Endocrine function measurements in 


male control and early KP~'-C mice. a, Fasted 


n= 15 control and 12 early KP~/-C mice. P=0.28. b, Insulin levels in 


overnight fasted and fed mice. n= 4 per group. 


before and after a glucose injection. n =5 control and 6 early KP~/~C 
mice. P= 0.13 (time = 0) and P= 0.16 (time = 30). d, Glucose levels in 
overnight fasted and fed mice. = 11 control and 12 early KP~/~C mice. 
*P = 0.00008. e-g, Faecal analysis of male control and early KP~/~C mice. 
e, Total faecal lipid. n =9 control and 10 early KP~/-C mice. *P=0.004. 


f, Total faecal protease activity. n = 4 per group. 


faecal protein level. n= 6 control and 9 early KP~/~C mice. *P = 0.004. 

h, Total faecal protein levels over time in male C57BL/6J mice following 
orthotopic implantation of PDAC cells into the pancreas. n =5. P=0.008 
(day 14 versus day 17) and *P =0.005 (day 14 versus day 21). i, Schematic 
of experimental design. j, Relative weights of KP~/~C male mice fed a 
control diet or a diet supplemented with pancreatic enzymes. n= 6 control 
diet and 7 supplemented diet. *P = 0.033. AT, adipose tissue; SM, skeletal 
muscle. k, Survival of male KP~/~C mice fed the indicated diet. n = 12 per 
group. Mantel-Cox test, P= 0.02. Unless otherwise indicated, statistical 
analysis was performed using unpaired two-sided t-tests, data are 

mean + s.e.m. and n represents the number of mice that were analysed. 


circulating glucagon levels. 


c, Insulin levels in mice 


. *P = 0.0009. g, Total 


Table 1 | Hazard ratios for death among cases with pancreatic cancer based on body composition 


Quintiles of body composition areas 


1 2 3 4 5 P trend* 

Visceral adipose tissue area 

umber of cases 36 38 38 138 137 
Median (cm?) 34.7 108.2 171.0 228.9 312.0 
Median overall survival (months) 2.3 0.2 11.5 11,3 11.1 
Hazard ratio (95% confidence interval) 1.0 1.25 (0.96-1.62) 1.06 (0.82-1.38) 1.08 (0.83-1.41) 1.04 (0.79-1.36) 0.73 
Hazard ratio (95% confidence interval)° 1.0 1.33 (1.01-1.74) 1.10 (0.82-1.47) 1.14 (0.84-1.56) 1.16 (0.82-1.63) 0.31 
Subcutaneous adipose tissue area 

umber of cases 36 38 37 138 137 

edian (cm?) 82.7 136.9 177.5 234.7 351.0 

edian overall survival (months) 19 0.6 2.0 11.6 10.2 
Hazard ratio (95% confidence interval)” 0 1.18 (0.91-1.53) 0.97 (0.75-1.26) 0.92 (0.71-1.19) 1.06 (0.81-1.38) 0.80 
Hazard ratio (95% confidence interval)° 1.0 1.19 (0.91-1.57) 0.99 (0.74-1.32) 1.04 (0.77-1.41) 1.25 (0.86-1.82) 0.44 
Sarcopenia‘ No Yes P value 

umber of cases® 248 462 
Median overall survival (months) 11.6 11.3 
Hazard ratio (95% confidence interval)° 0 1.03 (0.86-1.24) 0.74 
Hazard ratio (95% confidence interval)* 0 1.04 (0.85-1.27) 0.72 
Sarcopenia and obesity Neither Obese Sarcopenic Both 

umber of cases® 54 186 191 262 

edian overall survival (months) Lal 12.0 10.4 171.8 
Hazard ratio (95% confidence interval)” 0) 0.98 (0.69-1.38) 1.11 (0.79-1.57) 0.95 (0.67-1.33) 
Hazard ratio (95% confidence interval)° 1.0 0.90 (0.60-1.35) 1.12 (0.80-1.58) 0.91 (0.63-1.32) 


@The two-sided P trend is calculated by entering the quintile-s 


ecific median va 


ue for adipose tissue area as a continuous variable in the Cox proportional hazards model. 


’Cox proportional hazards model adjusted for age at diagnosis (continuous), gender (male or female), race (white, non-white or unknown), year of diagnosis (2000-2005, 2006-2010 or 2011-2015), 


institution (Dana-Farber/Brigham and Women’s Cancer Cente: 
advanced, metastatic or unknown). 


r, Massachusetts General Hospital, Mayo Clinic, Stanford University or University of North Carolina) and cancer stage (local, locally 


°Cox proportional hazards model additionally adjusted for body mass index (continuous), diabetes history (none, <4 years, >4 years or unknown) and smoking status (never, past, current or unknown). 


4Sarcopenia is defined as a skeletal muscle index (the ratio of 


he skeletal muscle area (in cm?) to the height squared (in m2)) of less than 55.4cm? per m? for men and less than 38.9cm? per m? for 


women. This was measured at baseline computed tomography imaging. Obesity is defined as a body mass index of more than 25 kg per m?. 


*For sarcopenia, 8 patients were excluded owing to missing information on heigl! 
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t. For sarcopenia and obesity, a further 17 patients were excluded owing to missing information on weight. 
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survival are consistent with data that suggest that caloric restriction 
improves survival of mice with PDAC™, perhaps via similar mech- 
anisms. Although not examined here, changes in insulin levels may 
also contribute to survival differences upon reversal of a starvation-like 
state. The finding that pancreatic enzyme supplementation led to worse 
survival in mice also suggests that peripheral tissue wasting in early 
PDAC may be distinct from cachexia associated with late-stage disease. 
Nutritional intervention and pancreatic enzyme replacement are some- 
times used in patients with PDAC” and a better understanding of the 
mechanisms that cause tissue wasting across cancer types and stages 
of disease is needed to design interventions that reduce functional dis- 
ability and improve survival of patients with cancer. 
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Any Methods, including any statements of data availability and Nature Research 
reporting summaries, along with any additional references and Source Data files, 
are available in the online version of the paper at https://doi.org/10.1038/s41586- 
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METHODS 

Cell culture. All cells were cultured in DMEM supplemented with 10% FBS 
(Corning) and 1% penicillin and streptomycin (Corning). 

Western blot analysis. Antibodies recognizing pHSL (ser563) (4139) and total 
HSL (4107) were purchased from Cell Signaling Technologies. 

Animal studies. All experiments performed in this study were approved by the 
MIT Committee on Animal Care (IACUC). Furthermore, for subcutaneous 
tumour growth, a maximum tumour burden of 2cm? was permitted by our IACUC 
protocol and these limits were not exceeded. All mice in this study were fully 
backcrossed to the C57BL/6J background, and housed under a 12-h light and 12-h 
dark cycle, and cohoused with littermates with ad libitum access to water and food, 
unless otherwise stated. Furthermore, all experimental groups were age-matched 
and assigned based on genotype (or treatment). All animals were numbered and 
experiments were conducted in a blinded manner. After data collection, genotypes 
were revealed and animals assigned to groups for analysis. All measurements were 
collected from distinct animals, n represents biologically independent samples, and 
mice were analysed as tumours developed. No statistical methods were performed 
to predetermine sample size. 

For studies using Kras®?? Trp53/“"'Pdx 1 (KP~/~C) mice, controls included 
littermates that lacked the Cre-recombinase allele, LSL-Kras®!”” allele or both. 
For studies using Kras¢!?? Trp53®!72H/* pdx 1? (KPC) mice, control mice included 
littermates that lacked the LSL-Kras©!” allele or LSL-p53'”" allele. For studies 
using Kras°!?> Pdx 1‘ (KC) mice, control mice included littermates expressing 
the Cre allele. 

For subcutaneous or pancreatic orthotopic tumours, C57BL/6J mice (000664) 
were injected with 10° mouse PDAC cells isolated from C57BL/6] KP~/~C mice 
as previously described’. Phosphate-buffered saline (PBS) or PDAC cells were 
injected into the right flank (in 100,11) or the pancreas (5011) of 8-week-old 
C57BL/6] mice. For protein faecal collection studies in mice bearing orthotopic 
pancreatic tumours, mice were injected with 10° mouse PDAC cells (5011) and 
allowed to recover from surgery for two weeks. Faecal matter was collected after 
this two-week recovery time every 2-3 days for approximately 2-3 weeks and 
placed in —20°C until the day of analysis. For studies using PSCs, we isolated 
PSCs as previously described”°. After a few passages, PSCs were immortalized and 
injected at a 1:1 ratio of PSCs to cancer cells as previously described». 

For experiments using metabolic cages, 5-6-week-old mice were placed in met- 
abolic cages and food intake, respiratory exchange ratio (RER), volumetric rate of 
O, consumption and volumetric of CO, production were measured over a three- 
day period (TSE Systems). RER was calculated using the following formula: 
RER = Voo,/Vo, 

For pairfeeding experiments, the amount of diet supplemented with pancreatic 
enzymes that was consumed per mouse per day was calculated for an average 
of five days. Animals with early PDAC were weighed, individually housed and 
randomly given a preweighed quantity of food that was either control diet or diet 
supplemented with pancreatic enzymes. Mice were weighed before the start of 
the food-pairing experiment to ensure both groups of mice had similar starting 
body weights. 

For caloric-restriction studies, mice were injected subcutaneously (into both 
flanks) with PDAC-derived cells as described above. Animals were randomly 
placed on AIN-93 G (TD.94045) control diet or on the same diet at 40% restric- 
tion after tumours were palpable. Mice were individually housed and fed daily 
either 3.2 g per day (control mice) or 1.9g per day (calorie-restricted mice) for a 
total of three weeks. 

Animal diets. For diets supplemented with pancreatic enzymes, AIN-93G pow- 
dered diet (TD.94045) was purchased from Envigo and mixed with a commercial 
preparation of pancreatic enzymes as previously described”. 

Assessment of glucose metabolism. Glucose levels were measured using 
a Breeze-2 glucose meter (Bayer). For insulin measurements, we used an 
Ultrasensitive Mouse Insulin ELISA (#90080) following the manufacturer’s pro- 
tocol. For in vivo insulin secretion, mice were fasted for 16h and intraperitoneally 
injected with glucose (1g kg~'), blood samples were collected and analysed at the 
indicated time points. 

Ex vivo lipolysis. Ex vivo lipolysis assays of adipose explants were performed as 
previously described”*. In brief, epididymal adipose tissues were collected and 
incubated at 37°C, non-esterified fatty acids were measured using Wako diagnos- 
tics and glycerol release was measured using the Free Glycerol Determination kit 
(FG0100; Sigma-Aldrich) according to the manufacturer's instructions. 
Hormone and metabolite measurements. Blood samples were collected in EDTA- 
containing tubes and centrifuged 3,000 r.p.m. for 15 min (4°C). Circulating levels 
of IL-6 (M6000B, R&D Systems), corticosterone (80556, Crystal Chem), amylin 
(EIAM-AMY-1, RayBiotech) were measured according to the manufacturer’s 
instructions. TNE, IFNy, IL-10, IL-16, IL-17 and IL-4 were measured using a 
Discovery Assay (Eve Technologies). Circulating BCAAs levels were measured 
as previously described’. 


Micro-computed tomography imaging in mice. All micro-computed tomog- 
raphy (CT) measurements in mice were performed using GE eXplore CT120. 
The scans were conducted at 70kVp, 50 mA and 32 ms. There were 720 views, 
0.5 degrees apart over a full 360-degree rotation. To assess muscle volume, a 3D 
Gaussian filter was used in an area extending from the right ankle to the proximal 
end of the fibula. This filtered dataset was used to determine areas corresponding 
to leg muscle while excluding other tissues. This was done in MATLAB using the 
connected components (bwconncomp) function. Having created the mask (seg- 
mented the leg muscle from other tissues), a histogram of the muscle region was 
calculated. Voxels falling within the density range of 160-200 Hounsfield units 
(HU) were considered muscle. This was done to correct for any overlap of the 
muscle mask with adjacent bone or adipose (which have higher and lower HU 
values, respectively). 

Faecal assays. To assess total faecal protein, 10 mg of faeces was resuspended 
in lysis buffer (2% SDS, 150 mM NaCl, 0.5M EDTA), sonicated and the protein 
concentration was assessed using a BCA assay according to the manufacturer's 
instructions. Total faecal protease activity was measured as previously described”. 
In brief, 10-30 mg of faecal matter was resuspended in 1 ml of buffer A (0.1%Triton 
X-100, 0.5 M NaCl, 100mM CaCl,), sonicated and centrifuged. The supernatant 
was then incubated with 3% Azo-Casein (Sigma-Aldrich, A2765) at 37°C for 
60 min. The reaction was stopped using 8% trichloroacetic acid and centrifuged. 
The absorbance of the supernatant (measured at 366 nm) was measured using a 
spectrophotometer. Total faecal lipids were measured as previously described”. In 
brief, 1,000 mg of faeces was collected and lipids were extracted using a 2:1 chloro- 
form:methanol solution. The lipid fraction was dried using a stream of gaseous 
nitrogen and the vials were weighed. 

Histological analysis. Tissues were fixed overnight with neutral-buffered 10% 
formalin, paraffin-embedded, sectioned and stained with haematoxylin and eosin 
or stained with Masson's trichrome using standard protocols. 

Statistics for animal data. All graphs were generated using Prism (GraphPad) 
software and data are mean + s.e.m. Unless otherwise indicated, P values were 
determined using unpaired two-sided t-tests. Statistical outliers were measured 
using a Grubb’ outlier test (Prism) and excluded from the final analysis. 

Human population study. Our study population included patients with pan- 
creatic cancer from five US cancer centres: Dana-Farber/Brigham and Women’s 
Cancer Center (DF/BWCC), Massachusetts General Hospital (MGH), Mayo 
Clinic, Stanford University and University of North Carolina-Chapel Hill (UNC). 
We included 782 patients with pancreatic adenocarcinoma who were diagnosed 
between 2000 and 2015, and had a stored plasma sample collected before receiving 
any treatment for their malignancy, including surgery, radiation or chemotherapy. 
A total of 778 patients at the five institutions met these criteria and had a plasma 
sample collected within 30 days before their pathological diagnosis and 60 days 
after this diagnosis. Of these patients, 687 had a CT scan performed during this 
time period for the analysis of body composition. The overall study was approved 
by the Dana-Farber/Harvard Cancer Center IRB, and data abstraction and blood 
sample collection was approved by each individual institutional IRB. All partici- 
pants provided informed consent. 

Human plasma samples and metabolite profiling. Blood was collected in sterile 
EDTA tubes, and processed within 3h (Dana-Farber Cancer Institute, Mayo Clinic, 
Stanford University and UNC) or 24h (MGH) for separation into plasma and other 
components. Plasma was aliquoted into cryovials and stored at —80°C. Plasma 
samples were thawed once on wet ice to aliquot into smaller sample volumes for 
analysis, de-identified and sent for analysis in a single shipment. 

Liquid chromatography tandem mass spectrometry (LC-MS) analyses were 
conducted using a Shimadzu Nexera X2 U-HPLC (Shimadzu) coupled to aQ 
Exactive hybrid quadrupole orbitrap mass spectrometer (Thermo Fisher Scientific). 
Polar metabolites were extracted from plasma (101) using 90 jul of 74.9:24.9:0.2 
v/v/v acetonitrile:methanol:formic acid containing stable isotope-labelled internal 
standards (valine-dg (Sigma-Aldrich) and phenylalanine-dg (Cambridge Isotope 
Laboratories)). The samples were centrifuged (10 min, 9,000g, 4°C) and the super- 
natants were injected directly onto a 150 x 2-mm/?, 3-1m Atlantis HILIC column 
(Waters). The column was eluted isocratically at a flow rate of 25011 min“! with 
5% mobile phase A (10mM ammonium formate and 0.1% formic acid in water) 
for 0.5 min followed by a linear gradient to 40% mobile phase B (acetonitrile with 
0.1% formic acid) over 10 min. MS analyses were carried out using electrospray 
ionization in the positive ion mode using full scan analysis over 70-800 m/z at 
70,000 resolution and 3-Hz data acquisition rate. Other MS settings were: sheath 
gas 40, sweep gas 2, spray voltage 3.5kV, capillary temperature 350°C, S-lens RF 40, 
heater temperature 300°C, microscans 1, automatic gain control target 1 x 10°, and 
maximum ion time 250 ms. Raw data were processed using TraceFinder software 
(Thermo Fisher Scientific). Metabolite identities were confirmed using authentic 
reference standards. 

To evaluate reproducibility of BCAA measurements, we included 77 blinded 
quality control plasma samples within the larger sample set, from eight plasma 
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quality control pools. The mean coefficients of variance were 7.6% for isoleu- 
cine, 8.0% for leucine and 7.3% for valine. Because blood samples from MGH 
were processed after >3h, we evaluated the reproducibility of BCAA measure- 
ments between samples processed at different time points after collection. For 
10 patients, we processed blood immediately and at 24h and measured plasma 
BCAAs. Spearman correlation coefficients for the BCAA levels at the two time 
points were 0.95 for isoleucine, 0.95 for leucine and 0.92 for valine. All blood sam- 
ples were collected into EDTA plasma tubes, except for 28 blood samples collected 
as serum from MGH patients. To evaluate BCAAs in plasma versus serum tubes, 
we measured BCAA levels for 10 patients who had simultaneous collection of 
plasma and serum. Spearman correlation coefficients for plasma BCCAs by plasma 
versus serum samples were 0.84 for isoleucine, 0.76 for leucine and 0.94 for valine. 
Given the high correlation coefficients for plasma BCAAs for both time of blood 
processing and plasma versus serum collection, we included all MGH patients in 
our study population. 

Computed tomography imaging quantification of muscle and adipose tissue 
in patients. Muscle, visceral adipose tissue and subcutaneous adipose tissue areas 
were measured through the third lumbar vertebra landmark on CT imaging as 
previously published!”"®. Scans from the DF/BWCC, MGH, Stanford University 
and UNC study sites (n = 363) were manually segmented using Slice-O-Matic 
software (v.4.3; Tomovision) by trained image analysts with final verification 
from a board-certified radiologist. All paraspinal and abdominal wall muscles 
were included in the muscle area. The Hounsfield CT attenuation scale was used 
to constrain adipose tissue and muscle selection. The Hounsfield scale is a linear 
transformation of the linear attenuation coefficient, with fixed points calibrated for 
air at —1,000 HU and water at 0 HU. All medical CT scanners are routinely cali- 
brated to this scale. Pixel attenuation constraints from —29 to 150 HU were used for 
muscle and from —180 to —30 HU for adipose tissue as previously published!”!8. 
The visceral adipose tissue compartment was defined by the peritoneum; all extra- 
peritoneal adipose tissue was included in the subcutaneous compartment. Pixel 
dimensions were extracted from scanner parameters embedded within the scan 
data; the total area was measured as the product of segmented pixel count and 
pixel area. We calculated skeletal muscle index (SMI) as the ratio of skeletal muscle 
area (cm?) to height squared (m7). Muscle attenuation was defined as the average 
Hounsfield attenuation of all pixels in the muscle area. Analyst performance was 
tested, and a test-retest coefficient of variation <1.1% was observed for all analysts 
and parameters. 

Scans from the Mayo Clinic site (n = 324) were analysed using in-house devel- 
oped software written in MATLAB (MATLAB 2015b, MathWorks) with manual 
correction by a trained image analyst. The software automatically fits three concen- 
tric closed contours at the air-skin boundary, the subcutaneous adipose-muscle 
boundary and the abdominal wall-peritoneal adipose boundary using hierarchi- 
cal morphological classification constrained by a prior probability shape model. 
These boundaries defined the same zones as were described for the Slice-O-Matic 
method. The software also automatically created masks for bone and colonic 
content and these masks were used to exclude bone and colonic content from 
being included as muscle or adipose tissue. The final boundaries were verified by 
a board-certified radiologist. Adipose and muscle areas were then calculated for 
each compartment using the same attenuation constraints as described for the 
Slice-O-Matic method. 

To ensure consistency across sites, we analysed 20 cases using both methods and 

found a coefficient of variation of 2.4%, 1.7% and 3.8% between the methods for 
the skeletal muscle, subcutaneous adipose tissue and visceral adipose tissue areas, 
respectively. The maximum observed differences were 4.1%, 3.6%, and 15.2% for 
these compartments. 
Covariate data and statistical analysis of patient data. Using patient question- 
naires and medical records, we extracted information on patient and clinical 
characteristics, including: age at diagnosis, gender, race/ethnicity, height, weight 
at blood collection, diabetes history (no diabetes, diabetes of duration <4 years, 
diabetes of duration >4 years), tobacco use (never, past, current), primary tumour 
location (head or uncinate, body, tail, other), cancer stage (local, locally advanced, 
metastatic), year of diagnosis and survival time. 

Body composition measurements and plasma BCAAs by patient characteristics 
were compared using Wilcoxon rank-sum test or Kruskal-Wallis test. A Pearson 
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correlation test was used to evaluate the linear relationship between body compo- 
sition measurements, plasma BCAAs and other patient characteristics. To evalu- 
ate the associations between patient survival, body composition measurements, 
and plasma BCAAs, we used multivariable-adjusted Cox proportional hazards 
regression and calculated hazard ratios and 95% confidence intervals. Survival 
time was calculated as the time from diagnosis to death or date of last follow-up 
if the patient was still alive. In an initial multivariate model, we adjusted for age 
at diagnosis (continuous), gender (male or female), race (white, non-white or 
unknown), year of diagnosis (2000-2005, 2006-2010 or 2011-2015), institution 
(DF/BWCC, MGH, Mayo Clinic, Stanford University or UNC), and cancer stage 
(localized, locally advanced, metastatic or unknown). In a second multivariate 
model, we additionally adjusted for body mass index (BMI) (continuous), history 
of diabetes (none, <4 years, >4 years or unknown), and smoking status (never, 
past, current or unknown), which have previously been shown to associate with 
patient survival’). 

Because cut-offs to define clinically relevant body composition categories are 
not well-defined, we also divided patients into quintiles by adipose tissue and 
muscle tissue measurements. Because the distribution of body composition meas- 
urements differed by gender (Extended Data Table 2), we created gender-specific 
quintiles for all body composition measurements. P values for linear trends were 
calculated by entering the quantile-specific median value for body composition 
measurements in the Cox proportional hazards models. 

To evaluate sarcopenia, we used the SMI criteria established by a recent interna- 
tional consensus definition of cancer cachexia!+. The SMI cut-offs were 55.4. cm? 
per m” for men and 38.9 cm? per m” for women. To evaluate sarcopenic obesity, we 
generated a combined variable, for which overweight or obesity status is defined 
as BMI > 25kg per m’, similar to a previous study in patients with PDAC”. The 
resulting four categories were overweight/obese and sarcopenic, overweight/obese 
and non-sarcopenic, non-overweight/obese and sarcopenic, non-overweight/obese 
and non-sarcopenic. For plasma BCAAs, patients were divided into quartiles, with 
quartiles 2-4 compared to quartile 1 as the reference. P values for trends were 
calculated by entering quartile-specific median values for total plasma BCAAs as 
a continuous variable in the Cox proportional hazards model. 

We performed stratified analyses by disease stage and statistical interactions 

were assessed using the Wald test of the cross-product term between stage (local- 
ized, locally advanced, metastatic or unknown) and tissue measurements (con- 
tinuous). We also calculated the hazard ratio for each institution and measured 
heterogeneity across centres using the Cochran's Q statistic**. This statistic is a 
weighted sum of squared deviations of the estimate of an individual study from the 
overall estimate obtained by meta-analysis. All analyses were performed with SAS 
9.2 statistical analysis software, and all P values are from two-sided tests. 
Reporting summary. Further information on experimental design is available in 
the Nature Research Reporting Summary linked to this paper. 
Data availability. Source Data for the graphical representations found in Figs. 1-3 
and Extended Data Figs. 1-3 can be found in the online version of the paper. All 
other data that support the findings of this study are available upon request from 
the corresponding author. 
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Extended Data Fig. 1 | See next page for caption. 


Extended Data Fig. 1 | PDAC is associated with adipose and 

skeletal muscle wasting. a, Circulating BCAAs (valine, leucine, and 
isoleucine) in male control (1 = 12) and early KP~/~C (n= 10) mice. 

b, Representative histology of H&E-stained gastrocnemius skeletal 
muscle of control and early KP~/~C mice. n=4 per group. c, Relative 
myofibre area in male control and early KP~/~C mice. n = 3 per group. 
*P <.0001. d, Representative 3D j1CT imaging reconstruction of soleus 
and gastrocnemius skeletal muscle (highlighted in red). e, Relative 
soleus and gastrocnemius skeletal muscle as assessed by micro-CT 

scan of control and early KP~/~C male mice. n = 10 per group. P=0.04. 
f, Skeletal muscle tissue mass of the indicated muscle groups in male 
control (m= 8) and early KP~/-C mice (n =9). *P = 0.006 (quadriceps), 
*P=0.02 (tibialis anterior), *P=0.004 (soleus). g, Relative mRNA 


expression of the indicated genes assessed by RT-qPCR. n=4 per group. 


P=0.05 (Mstn), *P=0.01 (Trim63), *P = 0.07 (Fbx032), *P = 0.00004 
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(Map 1Ic3a), *P = .006 (Gabarapl1). h, Representative histology of 
H&E-stained epididymal adipose tissue from control and early KP~/~C 
male mice. n=4 per group. i, Relative adipocyte area in male control and 
early KP-'~C mice. n=3 per group. P< 0.0001. j, Glycerol release in ex 
vivo adipose tissue explants from control and early KP~/~C male mice. 
n=7 per group. P= 0.01. k, Non-esterified fatty acid (NEFA) release in ex 
vivo adipose tissue explants from control (n = 8) and KP~/~C male (n=7) 
mice. P=0.0002. 1, Representative western blot analysis of phosphorylated 
(p)HSL and HSL expression in adipose tissue of control and early KP~/~C 
male mice. n=3 per group. m, Representative H&E histology images 

of the pancreas of 15-week-old KC male mice. n=5 per group. Unless 
otherwise indicated, statistical analysis was performed using unpaired 
two-sided t-tests, data are mean + s.e.m. and n represents the number of 
mice that were analysed. 
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Extended Data Fig. 2 | Systemic circulating factors are not altered in 
early PDAC. a-j, Circulating levels of the indicated factors in control and 
early KP~/-C mice. a, IL-6. n= 16 control and 14 early KP~/-C mice. ns, 
not significant; P= 0.45. b, TNF-a. n= 12 control and 10 early KP-/-C 
mice. P=0.45. c, PTHrP. n=7 control and 8 early KP~/~C mice. P=0.94. 
d, Corticosterone. n= 19 control and 16 early KP-'/-C mice. P=0.40. 

e, Amylin. n=7 control and 6 early KP~'-C mice. P=0.91. f, IL-18.n=11 
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control and 10 early KP~'-C mice. P=0.39. g, IL-4. n=12 control and 10 
early KP-/~C mice. P=0.56. h, IFN. 1 = 12 control and 9 early KP-/~C 
mice. P= 0.08. i, IL-10. n= 12 control and 10 early KP~/-C mice. P=0.42. 
j, IL-17. n= 12 and 10 early KP~/-C mice P= 0.29. Unless otherwise 
indicated, statistical analysis was performed using unpaired two-sided 
t-tests, data are mean + s.e.m. and n represents the number of mice that 
were analysed. 
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Extended Data Fig. 3 | Decreased exocrine pancreatic function in early 
PDAC disease promotes adipose tissue loss. a~d, C57BL/6J mice bearing 
PDAC-derived subcutaneous tumours fed a control diet or the same 

diet at 40% caloric restriction (CR) for 3 weeks. n =8 per group. a, Body 
weight. *P < 0.0001. b, Epididymal adipose tissue (eWAT) and inguinal 
adipose tissue (iWAT) mass normalized to body weight. *P < 0.0001 for 
epididymal adipose tissue and *P = 0.0034 for inguinal adipose tissue. 

c, Skeletal muscle mass of the indicated muscle groups normalized to body 
weight. d, Tumour volume normalized to body weight. *P = 0.002. n= 15 
tumours from 8 control mice (2 tumours per mouse; for one mouse one of 
the tumours did not grow) and n = 16 tumours from 8 calorie-restricted 
mice. e, Fed plasma glucose levels of 7-week-old male control and KP-/-C 


mice that were fed the indicated diets. n =4 per group. P=0.18. f, Tissue 
weights normalized to body weight in KP~/~C mice pair-fed indicated 
diets. n= 4 per group. *P=0.01. ns, P=0.53. g, h, Tumour weight 
normalized to body weight of KP~/~C mice that were fed the indicated 
diets for 1 week. g, KP~'~C mice were fed the indicated diets. n = 6 control 
and 7 enzyme-supplemented. P= 0.9. h, KP~/~C mice were pair-fed the 
indicated diets. n= 4 per group. P= 0.26. i, Representative histology of 
H&E-stained autochthonous pancreatic tumours of KP~/~C mice that 
were fed the indicated diets. n = 4 per group. Unless otherwise indicated, 
statistical analysis was performed using unpaired two-sided t-tests, data 
are mean + s.e.m. and n represents the number of mice that were analysed. 
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Extended Data Fig. 4 | Use of CT imaging to assess patient body cancer stage (local, locally advanced, metastatic or unknown), BMI 
composition and relationship between plasma BCAA levels and patient (continuous), diabetes history (none, <4 years, >4 years or unknown) 
survival by study site. a, Representative CT image used to analyse body and smoking status (never, past, current or unknown). The pooled hazard 
composition. Skeletal muscle is shown in red, intramuscular adipose ratios were calculated using the DerSimonian and Laird random-effects 
tissue is shown in green, visceral adipose tissue is shown in yellow and model. The solid squares and horizontal lines correspond to the study 
subcutaneous adipose tissue is shown in blue. b. Hazard ratios (HRs) site-specific multivariate hazard ratio and 95% confidence interval, 
and 95% confidence intervals (CI) for the association between plasma respectively. The area of the solid square reflects the study site-specific 
BCAAs and patient survival, comparing the top and bottom quartile, weight (inverse of the variance). The filled diamond represents the pooled 
calculated using Cox proportional hazards model adjusted for age at hazard ratio and 95% confidence interval. The solid vertical line indicates 
diagnosis (continuous), gender (male or female), race (white, non-white a hazard ratio of 1.0.n=778. 


or unknown), year of diagnosis (2000-2005, 2006-2010 or 2011-2015), 
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Extended Data Table 1 | Characteristics for patients with pancreatic cancer 


a. 
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Overall DF/BWCC MGH Stanford UNC Mayo 
No. of patients 782 281 58 36 57 350 
Diagnosis period, years 
2000-2005 110 (14) 25 (9) 0 (0) 14 (39) 0 (0) 71 (20) 
2006-2010 292 (37) 90 (32) 18 (31) 15 (42) 24 (42) 145 (41) 
2011-2015 380 (49) 166 (59) 40 (69) 7 (19) 33 (58) 134 (38) 
Age at diagnosis, years 66.0 (10.7) 64.3 (10.0) 69.4 (10.2) 67.9 (13.1) 68.2 (11.1) 66.2 (10.7) 
Female sex 347 (44) 133 (48) 28 (48) 11 (32) 31 (54) 144 (41) 
White race 698 (89) 252 (90) 39 (67) 24 (67) 39 (68) 344 (98) 
Body mass index, kg/m? 27.4 (5.2) 26.7 (5.2) 27.0 (5.7) 24.7 (4.7) 27.4 (4.7) 28.2 (5.0) 
Diabetes history 
No diabetes 565 (72) 194 (69) 46 (79) 28 (78) 32 (56) 265 (76) 
Diabetes <4 years 100 (13) 38 (14) 5 (9) 4 (11) 8 (14) 45 (13) 
Diabetes >4 years 54 (7) 20 (7) 4 (7) 2 (6) 7 (12) 21 (6) 
Unknown 63 (8) 29 (10) 3 (5) 2 (6) 10 (18) 19 (5) 
Tobacco use 
Never 335 (43) 127 (45) 14 (24) 18 (50) 20 (35) 156 (45) 
Past 330 (42) 121 (43) 36 (62) 13 (36) 25 (44) 135 (39) 
Current 68 (9) 30 (11) 8 (14) 4 (11) 12 (21) 14 (4) 
Missing 49 (6) 3 (1) 0 (0) 1 (3) 0 (0) 45 (13) 
Primary tumor location 
Head / Uncinate 481 (62) 161 (57) 41 (71) 24 (67) 49 (86) 206 (59) 
Body 142 (18) 64 (23) 6 (10) 8 (22) 3 (5) 61 (17) 
Tail 97 (12) 47 (17) 11 (19) 3 (8) 5 (9) 31 (9) 
Body and tail 31 (4) 5 (2) 0 (0) 1 (3) 0 (0) 25 (7) 
Head and body 22 (3) 0 (0) 0 (0) 0 (0) 0 (0) 22 (6) 
Other 9 (1) 4(1) 0 (0) 0 (0) 0 (0) 5 (1) 
Cancer stage 
Local 258 (33) 52 (19) 51 (88) 5 (14) 54 (95) 96 (27) 
Locally advanced 157 (20) 47 (17) 0 (0) 21 (58) 1 (2) 88 (25) 
Metastatic 308 (39) 175 (62) 4(7) 9 (25) 2 (4) 118 (34) 
Unknown 59 (8) 7 (2) 3 (5) 1 (3) 0 (0) 48 (14) 
Median survival, months 
Local 20.6 20.5 25.8 6.3 17.4 22.4 
Locally advanced 11.0 13.0 n/a 7.9 1.4 11.1 
Metastatic 7.0 6.9 15.5 4.9 3.9 8.2 
b. 
Full Study Blood CT Images 
Population Samples (Pt No.) 
(Pt No.) (Pt No.) 
DF/BWCC 281 280 244 
MGH 58 58 38 
Stanford 36 34 33 
UNC 57 56 48 
Mayo Clinic 350 350 324 
Total 782 778 687 
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a, Characteristics for each study site. b, Number of patients for which blood samples and CT images were available. Continuous variables are reported as mean + s.d.; categorical variables are reported 
as number (percentage) at time of diagnosis unless otherwise noted. 
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Extended Data Table 2 | Body composition characteristics for patients with pancreatic cancer 


Pt No. SMI Muscle area Muscle Subcutaneous Visceral fat 
(cm?/m’) (cm?) attenuation fat area (cm’) area (cm?) 
(HU) 
Age 
< 60 yrs 194 47.9 (9.9) 144.4 (37.4) 39.1 (9.4) 206.9 (116.6) 155.0 (107.9) 
60 - <70 yrs 238 46.6 (11.1) 134.2 (34.9) 36.9 (9.6) 214.7 (115.1) 163.2 (103.5) 
270 yrs 255 41.7 (7.7) 118.4 (28.9) 32.4 (9.4) 185.3 (94.2) 163.7 (111.0) 
P-value® 9.8x10"° 3.3x10" 3.0x10°? 0.03 0.53 
Gender 
Male 384 49.5 (8.8) 153.8 (27.6) 37.0 (9.4) 185.8 (102.2) 203.0 (111.9) 
Female 303 39.5 (8.5) 102.6 (17.8) 34.5 (10.3) 221.5 (113.8) 107.8 (72.7) 
P-value® 3.7x10°" 6.5x10° 0.004 2.3x10° 2.1x10°" 
Race 
White 621 45.0 (10.0) 131.4 (34.4) 35.6 (9.8) 201.2 (105.5) 163.9 (105.5) 
Non-White 42 46.8 (11.1) 129.7 (39.8) 37.6 (10.7) 216.4 (137.3) 117.6 (118.3) 
P-value* 0.24 0.60 0.15 0.80 0.001 
Diabetes 
No diabetes 498 44.4 (9.4) 129.4 (35.1) 37.0 (9.7) 191.4 (99.6) 148.8 (105.6) 
Diabetes <4 yrs 92 45.8 (8.5) 133.5 (33.0) 33.3.(10.1) 224.4 (112.3) 189.2 (99.0) 
Diabetes >4 yrs 48 48.0 (10.6) 144.1 (35.9) 35.1 (8.7) 216.5 (130.4) 207.2 (116.1) 
P-value* 0.01 0.01 0.004 0.03 3.2x10° 
BMI, kg/m? 
<18.5 14 35.1 (4.6) 92.4 (15.0) 44.9 (12.2) 61.3 (51.9) 24.3 (20.7) 
18.5-24.9 220 41.0 (8.0) 117.8 (31.2) 39.0 (9.2) 135.6 (64.8) 89.9 (67.2) 
>24.9-29.9 260 45.6 (8.1) 134.7 (32.1) 35.4 (8.7) 193.6 (73.3) 176.0 (91.1) 
>29.9-34.9 117 49.4 (10.2) 146.4 (36.3) 32.9 (9.7) 264.3 (83.0) 228.1 (100.2) 
>35 54 50.7 (10.0) 147.8 (35.6) 29.2 (10.9) 398.4 (121.2) 269.1 (128.0) 
P-value* 3.6x107? 7.1x10° 2.3x10" 1.0x107° 1.1x107° 
Current smoking 
No 586 45.2 (10.1) 131.3 (35.1) 35.8 (9.8) 201.6 (105.6) 163.0 (109.4) 
Yes 55 42.9 (8.6) 121.0 (29.9) 35.7 (11.1) 183.5 (112.6) 123.8 (88.4) 
P-value* 0.11 0.05 0.81 0.07 0.02 
Cancer stage 
Local 213 45.3 (10.6) 130.7 (32.9) 34.6 (9.5) 217.5 (113.3) 170.7 (110.9) 
Locally advanced 147 45.1 (9.9) 132.6 (35.6) 37.8 (10.3) 186.5 (103.3) 152.5 (99.6) 
Metastatic 279 45.3 (9.7) 131.6 (36.0) 35.9 (9.5) 190.5 (98.6) 155.6 (106.3) 
Unknown 48 43.5 (9.3) 126.9 (34.5) 35.2 (11.3) 241.0 (142.0) 176.0 (120.3) 
P-value* 1.00 0.87 0.01 0.02 0.28 
Primary tumor location 
Head/Uncinate 423 44.6 (10.2) 129.0 (33.8) 35.5 (10.1) 205.0 (110.7) 159.4 (105.6) 
Body 131 45.0 (9.0) 130.6 (32.5) 36.9 (9.2) 185.8 (94.4) 155.9 (104.3) 
Tail 78 46.5 (9.8) 137.0 (38.0) 34.5 (9.9) 213.9 (112.2) 177.9 (122.7) 
P-value® 0.22 0.26 0.10 0.17 0.51 


Body composition measurements reported as mean + s.d. 
*Two-sided P values were calculated using Wilcoxon rank-sum tests. 
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Extended Data Table 3 | Hazard ratios (with 95% confidence intervals) for death among cases of pancreatic cancer by body composition 
measurements using computed tomography 


a. 
Pt. No. Extreme Per S.D.* P-interaction® 
Quintiles*” 
Visceral fat area (cm?) 0.59 
Localized 213 = 2.02 (0.97-4.21) 1.15 (0.88-1.50) 
Locally advanced 147 0.53 (0.22-1.32) 0.81 (0.64-1.04) 
Metastatic 279 0.98 (0.54-1.76) 0.97 (0.80-1.17) 
Subcutaneous fat area (cm?) 0.72 
Localized 213 = 1.29 (0.58-2.87) 1.44 (1.05-1.98) 
Locally advanced 146 0.64 (0.26-1.60) 0.83 (0.68-1.00) 
Metastatic 279 0.99 (0.54-1.80) 0.91 (0.73-1.12) 
SMI (cm?/m?) 0.90 
Localized 210 0.66 (0.30-1.46) 0.84 (0.64-1.12) 
Locally advanced 145 0.59 (0.27-1.29) 0.87 (0.70-1.08) 
Metastatic 277 ~—- 1.54 (0.85-2.79) 1.05 (0.84-1.30) 
Muscle area (cm?) 0.47 
Localized 213 0.82 (0.34-1.98) 0.86 (0.62-1.19) 
Locally advanced 147 0.40 (0.16-1.00) 0.81 (0.62-1.06) 
Metastatic 279 = 1.56 (0.77-3.18) 1.10 (0.82-1.45) 
Muscle attenuation (HU) 0.49 
Localized 210 0.32 (0.16-0.67) 0.74 (0.60-0.90) 
Locally advanced 147 0.99 (0.50-1.96) 0.94 (0.75-1.19) 
Metastatic 275 = 1.01 (0.60-1.69) 0.96 (0.80-1.16) 
b. 
Quintiles of Body Composition Measurements P+trend’ 
1 2 3 4 5 
SMI (cm7/m”) 
Median OS (mo.) Aaa 11.0 10.8 11.6 11.7 
Hazard ratio (95% Cl)" 1.0 0.98 (0.76-1.27) 1.04 (0.80-1.36) 0.92 (0.70-1.19) 0.91 (0.68-1.21) 0.35 
Hazard ratio (95% Cl)° 1.0 1.01 (0.78-1.31) 1.04 (0.79-1.37) 0.94 (0.71-1.24) 0.93 (0.68-1.27) 0.47 
Muscle area (cm?) 
Median OS (mo.) 10.2 12.1 10.8 11.6 11.3 
Hazard ratio (95% Cl)" 1.0 0.75 (0.58-0.98) 0.90 (0.69-1.17) 0.81 (0.62-1.05) 0.84 (0.63-1.12) 0.50 
Hazard ratio (95% Cl)° 1.0 0.78 (0.60-1.02) 0.91 (0.69-1.20) 0.82 (0.62-1.10) 0.87 (0.64-1.20) 0.71 
Muscle attenuation (HU) 
Median OS (mo.) 10.2 11.2 10.8 12.7 11.7 
Hazard ratio (95% Cl)" 1.0 0.89 (0.68-1.15) 0.92 (0.70-1.20) 0.75 (0.57-0.98) 0.87 (0.65-1.15) 0.16 
Hazard ratio (95% Cl)° 1.0 0.86 (0.65-1.13) 0.89 (0.67-1.18) 0.71 (0.53-0.96) 0.80 (0.57-1.11) 0.10 


a, Patients were stratified by cancer stage at diagnosis. b, All patients. n = 687 patients. 
*Cox proportional hazards model adjusted for age at diagnosis (continuous), gender (male or female), race (white, non-white or unknown), year of diagnosis (2000-2005, 2006-2010 or 2011-2015), 
institution (DF/BWCC, MGH, Mayo Clinic, Stanford University or UNC), BMI (continuous), diabetes history (none, <4 years, >4 years or unknown) and smoking status (never, past, current or unknown). 
>Comparing the highest and lowest quintile. 
‘Two-sided P interaction calculated by Wald test of the cross-product between body component measurements (continuous) and stage (local, locally advanced, metastatic, unknown). 

4Cox proportional hazards model adjusted for age at diagnosis (continuous), gender (male or female), race (white, non-white or unknown), year of diagnosis (2000-2005, 2006-2010 or 2011-2015), 
institution (DF/BWCC, MGH, Mayo Clinic, Stanford University or UNC) and cancer stage (local, locally advanced, metastatic or unknown). 
©Cox proportional hazards model adjusted for age at diagnosis (continuous), gender (male or female), race (white, non-white or unknown), year of diagnosis (2000-2005, 2006-2010 or 2011-2015), 
institution (DF/BWCC, MGH, Mayo Clinic, Stanford University or UNC), cancer stage (local, locally advanced, metastatic or unknown), BMI (continuous), diabetes history (none, <4 years, >4 years or 
unknown) and smoking status (never, past, current or unknown). 
‘Two-sided P-trend values calculated by entering the quintile-specific median value for body composition measurements as a continuous variable in a Cox proportional hazards model. 
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Extended Data Table 4 | Plasma BCAA levels and clinical 
characteristics of pancreatic cancer cases 


PtNo. Total BCAAs 


(uM) 
Age, years 
< 60 220 351.6 (174.9) 
60 - <70 271 327.7 (104.3) 
270 287 297.7 (84.8) 
P-value* 1.3x10° 
Gender 
Male 432 346.1 (101.2) 
Female 346 295.0 (144.1) 
P-value* 3.4x10°8 
Race 
White 696 322.2 (98.3) 
Non-White 45 290.6 (122.2) 
P-value* 0.10 
Diabetes 
No diabetes 563 322.3 (129.0) 
Diabetes <4 years 99 330.9 (113.8) 
Diabetes >4 years 54 341.8 (122.7) 
P-value* 0.24 
Body-mass index, kg/m? 
<18.5 14 286.3 (120.4) 
18.5-24.9 245 316.7 (168.0) 
>24.9-29.9 295 322.6 (91.5) 
>29.9-34.9 131 337.0 (111.7) 
>35 63 335.3 (99.0) 
P-value* 0.01 
Current smoking 
No 661 323.0 (101.2) 
Yes 68 328.8 (291.0) 
P-value* 0.04 
Cancer stage 
Local 256 306.6 (89.0) 
Locally advanced 155 321.1 (96.1) 
Metastatic 308 332.9 (110.0) 
Unknown 59 348.1 (283.8) 
P-value* 0.05 
Primary tumor location 
Head/Uncinate 479 311.7 (137.8) 
Body 141 336.4 (107.1) 
Tail 96 345.5 (70.1) 
P-value® 6.6x107 


Plasma BCAA measurements reported as mean + s.d. 
*Two-sided P values calculated using Wilcoxon rank-sum test. 
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Extended Data Table 5 | Pearson correlation coefficients for BCAAs, body composition measurements and patient characteristics 


Variable BMI Age at Valine Leucine Isoleucine Total BCAA SMI Muscle area Muscle Subcutaneous Visceral fat 
diagnosis attenuation fat area area 
BMI 1.00 -0.14° 0.10° 0.06 0.01 0.06 0.43° 0.38" -0.30° 0.74" 0.58" 
Age at diagnosis 1.00 -0.19° -0.17° -0.14° -0.18" -0.29° -0.33° -0.32° -0.09° 0.03 
Valine 1.00 0.92° 0.78" 0.95" 0.22° 0.29" 0.18" 0.03 0.18" 
Leucine 1.00 0.91° 0.99" 0.17° 0.24" 0.17° -0.01 0.12° 
Isoleucine 1.00 0.93" 0.10° 0.14° 0.13° -0.01 0.07 
Total BCAA 1.00 0.17° 0.24" 0.17° 0.01 0.13° 
SMI 1.00 0.84" 0.20° 0.15° 0.37° 
Muscle area 1.00 0.23" 0.08° 0.44° 
Muscle attenuation 1.00 -0.40° -0.34" 
Subcutaneous fat area 1.00 0.43° 
Visceral fat area 1.00 


n=687 patients. 
*Two-sided P<0.05. 
>Two-sided P< 0.0001. 
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Extended Data Table 6 | Hazard ratios (with 95% confidence intervals) for death among pancreatic cancer cases by plasma BCAA levels at 
diagnosis 


Quartiles of Plasma Branched Chain Amino Acids P-trend* 
1 2 3 4 

Total BCAAs 

Median (uM) 217.7 284.3 343.3 425.0 

Range (uM) 72.9-255.7 255.8-312.8 312.9-374.3 374.5-2327.2 

Median OS (mo.) 10.6 10.8 12.4 12.1 

Hazard ratio (95% Cl)" 1.00 0.95 (0.76-1.19) 0.79 (0.62-0.99) 0.81 (0.64-1.02) 0.04 

Hazard ratio (95% Cl)° 1.00 0.96 (0.76-1.20) 0.78 (0.62-0.99) 0.82 (0.64-1.04) 0.06 
Isoleucine 

Median (uM) 43.0 54.9 66.1 86.0 

Range (uM) 12.7-49.8 49.9-60.4 60.5-73.6 73.6-824.7 

Median OS (mo.) 10.8 11 11.6 11.6 

Hazard ratio (95% Cl)° 1.0 0.89 (0.71-1.12) 0.93 (0.74-1.16) 0.89 (0.71-1.12) 0.42 

Hazard ratio (95% Cl)° 1.0 0.89 (0.70-1.12) 0.90 (0.72-1.14) 0.90 (0.71-1.14) 0.49 
Leucine 

Median (uM) 73.6 99.9 121.8 152.5 

Range (uM) 23.6-89.0 89.3-108.0 108.3-132.6 132.7-844.6 

Median OS (mo.) 10.6 11.1 12.1 11.9 

Hazard ratio (95% Cl)” 1.0 0.91 (0.73-1.14) 0.81 (0.64-1.02) 0.82 (0.65-1.04) 0.08 

Hazard ratio (95% Cl)° 1.0 0.93 (0.74-1.16) 0.82 (0.65-1.03) 0.84 (0.66-1.07) 0.12 
Valine 

Median (uM) 98.6 129.0 155.5 191.3 

Range (uM) 35.4-114.0 114.2-141.9 142.0-171.3 171.4-658.0 

Median OS (mo.) 9.9 11.5 12.4 12.1 

Hazard ratio (95% Cl)” 1.0 0.78 (0.62-0.99) 0.71 (0.56-0.90) 0.71 (0.56-0.91) 0.01 

Hazard ratio (95% Cl)° 1.0 0.79 (0.62-1.00) 0.71 (0.56-0.90) 0.74 (0.57-0.94) 0.02 


n=778 patients. 

*Two-sided P-trend calculated by entering the quartile-specific median value for individual plasma BCAAs as a continuous variable in a Cox proportional hazards model. 

’Cox proportional hazards model adjusted for age at diagnosis (continuous), gender (male or female), race (white, non-white or unknown), year of diagnosis (2000-2005, 2006-2010 or 2011-2015), 
institution (DF/BWCC, MGH, Mayo, Stanford University or UNC) and cancer stage (local, locally advanced, metastatic or unknown). 

°Cox proportional hazards model additionally adjusted for BMI (continuous), diabetes history (none, <4 years, >4 years or unknown) and smoking status (never, past, current or unknown). 
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A naturally occurring antiviral ribonucleotide 
encoded by the human genome 


Anthony S. Gizzi)°, Tyler L. Grove!>*, Jamie J. Arnold’, Joyce Jose’, Rohit K. Jangra’, Scott J. Garforth!, Quan Du, Sean M. Cahill’, 
Natalya G. Dulyaninova!, James D. Love‘, Kartik Chandran’, Anne R. Bresnick!, Craig E. Cameron? & Steven C. Almo!4* 


Viral infections continue to represent major challenges to public 
health, and an enhanced mechanistic understanding of the processes 
that contribute to viral life cycles is necessary for the development 
of new therapeutic strategies!. Viperin, a member of the radical 
S-adenosyl-L-methionine (SAM) superfamily of enzymes, is 
an interferon-inducible protein implicated in the inhibition of 
replication of a broad range of RNA and DNA viruses, including 
dengue virus, West Nile virus, hepatitis C virus, influenza A virus, 
rabies virus” and HIV**. Viperin has been suggested to elicit 
these broad antiviral activities through interactions with a large 
number of functionally unrelated host and viral proteins**. Here 
we demonstrate that viperin catalyses the conversion of cytidine 
triphosphate (CTP) to 3’-deoxy-3’,4’-didehydro-CTP (ddhCTP), a 
previously undescribed biologically relevant molecule, via a SAM- 
dependent radical mechanism. We show that mammalian cells 
expressing viperin and macrophages stimulated with IFNa produce 
substantial quantities of ddhCTP. We also establish that ddhCTP 
acts as a chain terminator for the RNA-dependent RNA polymerases 
from multiple members of the Flavivirus genus, and show that 
ddhCTP directly inhibits replication of Zika virus in vivo. These 
findings suggest a partially unifying mechanism for the broad 
antiviral effects of viperin that is based on the intrinsic enzymatic 
properties of the protein and involves the generation of a naturally 
occurring replication-chain terminator encoded by mammalian 
genomes. 

Consideration of genome context shows that in vertebrates viperin 
is always immediately adjacent to a gene annotated as cytidylate 
monophosphate kinase 2 (CMPK2) and in some organisms, such as 
Lacinutrix mariniflava, these two genes are fused (Extended Data 
Fig. 1a). These observations suggested that viperin might modify a 
nucleotide. Evaluation of approximately 200 constructs derived from 
8 mammalian species identified a Rattus norvegicus viperin (rVIP), 
including residues 51-361, which exhibited excellent properties 
in solution (Extended Data Fig. 1b, c). We screened rVIP against a 
diverse set of nucleotides and deoxynucleotides, looking for enhanced 
5’-deoxyadenosine (5’-dA) formation as an indicator of substrate acti- 
vation (See Supplementary Information for details). As with most other 
radical-SAM proteins, when provided with dithionite as an artificial 
electron donor rVIP performs reductive cleavage of SAM in the absence 
of substrates. As shown in Fig. 1a, CTP selectively activates 5’-dA pro- 
duction by approximately 130-fold relative to protein alone. Liquid chro- 
matography shows that, in addition to 5’-dA (9.1 min), another product 
is present (5.2 min) (Fig. 1b), which exhibits a UV-visible spectrum 
similar to CTP (Amax = 271 nm), indicating that the pyrimidine ring is 
not notably modified by the viperin-mediated reaction (Fig. 1c). Liquid 
chromatography coupled with mass spectrometry (LC-MS) showed the 
new compound exhibited a negative-ion mass to charge ratio (—m/z) 
of 464.1, which is 18 Da less than the —m/z of 482.1 of CTP (Fig. 1d, e). 

13C_1BC correlation spectroscopy (COSY) NMR on uniformly 
labelled 13C,!°N3-viperin product, and 1H-13C 2D heteronuclear 


single quantum coherence (2D HSQC) chemical-shift analysis and 
3!P NMR analysis on natural abundance viperin product were per- 
formed (see Supplementary Information for details, Extended Data 
Figs. 1d, 2; see Supplementary Table 1 for chemical-shift data). Taken 
together, all available NMR and mass spectrometry data are consist- 
ent with the rVIP-catalysed conversion of CTP to ddhCTP, in which 
dehydration—involving the loss of both the 4’ hydrogen and the 3’ 
hydroxyl group—occurs without rearrangement of the carbon skele- 
ton. It is notable that MoaA, which catalyses the conversion of GTP to 
(8S)-3/,8-cyclo-7,8-dihydroguanosine 5’-triphosphate”, is the enzyme 
with the highest sequence similarity to viperin for which an unambig- 
uous functional annotation exists. Additionally, the recent report of the 
structure of viperin from Mus musculus proposed that the presumptive 
catalytic site shares high amino acid conservation with MoaA and may 
operate on a nucleotide-like substrate®. These observations are consist- 
ent with the above data. 

rVIP has a Ky, of 183 + 28 tM for CTP and produces ddhCTP with 
a maximum velocity of 0.185 + 0.007 min“! (Fig. 1f). The intracellular 
concentration of CTP typically falls in the 1 mM range, which agrees 
well with the K,, obtained for rVIP for CTP’, and the rate of ddhCTP 
formation is consistent with that of other radical-SAM enzymes with 
their native substrates®. 5’-dA and ddhCTP production is tightly 
coupled, with one molecule of 5’-dA generated for every ddhCTP 
(Extended Data Fig. 3a). rVIP also produces ddhCTP when the reac- 
tion is initiated by an enzymatic reducing system, indicating that dith- 
ionite does not direct an unanticipated side reaction between rVIP and 
CTP (Extended Data Fig. 3b). 

A recent report described a radical-SAM enzyme from the ther- 
mophilic fungus Thielavia terrestris (58% sequence identity to human 
viperin) that was capable of catalysing the coupling of UDP-glucose 
and 5’-dA to generate an uncharacterized product with a m/z of 818.1?. 
In addition, a preliminary report suggested that viperin homologues 
from Methanofollis liminatans (archaea, 35% sequence identity to 
human viperin) and Trichoderma virens (fungi, 55% identity) catalyse 
the radical-based condensation of 5’-dA and isopentenyl pyrophos- 
phate to yield adenylated isopentyl pyrophosphate!”. Substrate activa- 
tion and competition assays demonstrate that mammalian viperin does 
not catalyse these transformations (Fig. 1a, Extended Data Fig. 3c-f). 
We therefore conclude that UDP-glucose and isopenteny] pyroph- 
osphate are not likely to be physiological substrates for mammalian 
viperins; although it remains a possibility that other eukaryotes use 
viperin homologues to perform distinct functions. Analogous compe- 
tition experiments demonstrated that deoxyCTP is also not a substrate 
of rVIP (Extended Data Fig. 3g). 

Incubation of rVIP with SAM and CTP deuterated at the 2’, 3’, 
4’, 5’ and 5 positions (deuCTP), increased the —m/z of 5’-dA from 
250.1 to 251.1, consistent with the transfer of one deuterium from 
deuCTP to 5’-dA. Additionally, ddhCTP from this reaction exhibited 
a —m/z of 468.1, indicating that the deuterium abstracted by 5’-dA 
did not return to the product (Extended Data Fig. 3h, i). Derivatives 


1Department of Biochemistry, Albert Einstein College of Medicine, Bronx, NY, USA. @Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA, USA. 
3Department of Microbiology and Immunology, Albert Einstein College of Medicine, Bronx, NY, USA. “Institute for Protein Innovation, Boston, MA, USA. "These authors contributed equally: 


Anthony S. Gizzi, Tyler L. Grove. *e-mail: tyler.grove@einstein.yu.edu; steve.almo@einstein.yu.edu 


NATUR E|www.nature.com/nature 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


a 400, b 400, 5/-dA 
= 
= 3001 5 3004 
o ¢ 
£ — 
2 8 
é 2004 5 200 4 
3 1004 = 1001 
3 ddhCDP oe gghoTP 
copy 
ol 3 ) 
PRRLLLLLLL LLL LLLVRLLG EFL. 0 2 4 6 8 10 
F Rie SOHO AWWW SSNS = Retention time (min) 
of Vr 
Q 
S 
© 30 d 100, e 100; f 
=> > 0.20 ii 
3 3 ° ° 
cs cs 5 
> > 2 2 
20} ra 9 0.154 / 
oO oO 
2 = 504 = 504 we ‘ 
E 3 3 0.10 
N N 
5 5 0.05 
4 r 
o/ : 0) ty) o+——___—_, 
250 275 300 325 480 482 484 486 462 463 464 465 466 467 0 2,000 4,000 
Wavelength (nm) miz miz CTP (uM) 


Fig. 1 | Substrate specificity of viperin. a, A panel of nucleotides was 
mixed with rVIP and SAM, and the resulting 5’-dA measured. Data 

are from two independent experiments. b, High-performance liquid 
chromatography analysis showing viperin-mediated conversion of CTP 
to a new product (time = 0, blue; time = 45 min, red). c, UV-visible 
spectrum of CTP (blue) and the new product (ddhCTP, red). Absorbance 


of site-specifically deuterated CTP demonstrated that 5’-dAe 
initiates the reaction by uniquely abstracting the hydrogen from 
the 4’ position of CTP (see Supplementary Information for details, 
Extended Data Figs. 2, 4). On the basis of this observation, a provi- 
sional mechanism is outlined in Fig. 2d, in which viperin uses the 
5’-dAe to abstract the hydrogen atom at the 4’ position of the ribose 
of CTP, subsequently allowing for the loss of the 3’-hydroxyl group 
with general acid assistance. The resulting resonance-stabilized rad- 
ical cation is then reduced by one electron to yield the designated 
product. This mechanism has precedent from model studies of the 
radiolytic cleavage of single-stranded DNA, wherein generation of a 
4’-deoxyribosyl radical causes heterolytic dissociation of the 3’ phos- 
phate group, resulting in a 3’-cation-4’-yl radical’. The source of the 
additional electron needed to reduce intermediate three (or four) 
in the viperin-catalysed reaction is currently unclear. However, we 
propose that—similar to other radical-SAM enzyme reactions!?— 
the electron probably derives from a reduced 4Fe-4S cluster, which 
suggests that viperin requires two electrons to complete each turn- 
over: one to generate the 5’-dAe and another to reduce intermediate 
three (or four). 

CMPK2 is always immediately adjacent to viperin in vertebrate 
genomes and is co-transcribed with viperin during IFN stimulation’, 
suggesting a functional linkage. Human CMPK2 (hCMPK2) was 
previously reported to catalyse the ATP-dependent phosphorylation 
of the monophosphates CMP, UMP and dCMP to the corresponding 
diphosphates'*. By contrast, we find that hCMPK2 exhibits strong pref- 
erence for CDP and UDP as substrates, yielding CTP and UTP, respec- 
tively (Extended Data Fig. 5a). Notably, when provided with ddhCDP, 
hCMPK2 displayed a tenfold-lower activity for producing ddhCTP 
when compared to the rate of CTP and UTP formation (Extended Data 
Fig. 5a, Supplementary Table 2). Therefore, on the basis of the synteny 
and coordinated expression of viperin and CMPK2, and the availa- 
ble biochemical data, we propose that CMPK2 primarily functions to 
ensure sufficient substrate (that is, CTP produced from CDP, or by 
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maximum at 271 nm (dotted line). d, e, Mass spectrometry for CTP 

(d, —m/z = 482.1) and ddhCTP (e, —m/z = 464.1). All results have been 
repeated at least three times. f, Kinetic analysis of rVIP with CTP: Ky for 
CTP = 182.8 + 27.6 1M and Vinax = 0.185 + 0.007 min! (V/Er, initial 
velocity of the reaction divided by the total enzyme concentration). 

Data are mean + s.d. from three independent experiments. 
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Fig. 2 | Proposed mechanism for formation of ddhCTP. a, m/z of 

CTP (blue) or 4’-°H -CTP (red). b, Mass spectrometry of ddhCTP from 
reactions with either CTP (blue), or 4-H -CTP (red) and rVIP. Deuterium 
from 4’-7H -CTP is not retained in ddhCTP as products have the same 
-m/z 464.1. c, 5’-dA derived from 4’-7H -CTP (red trace) increases by one 
mass unit owing to the incorporation of deuterium. These experiments 
have been repeated at least three times with similar results. d, After 
hydrogen atom abstraction at the 4’ position of CTP, general base-assisted 
loss of the 3’ hydroxyl group leads to a carbocation/radical intermediate 
that is reduced by one electron to yield the ddhCTP product. 
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mean + s.d. from three biologically independent samples; P value is from 
a two-way ANOVA with Tukey’s post hoc test. NS, not significant. 
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CTP synthetase acting on UTP) for viperin-mediated production of 
ddhCTP during viral infection. 

To demonstrate that ddhCTP can be produced in mammalian 
cytosol, we generated a series of human viperin (hVIP) (83% identi- 
cal to rVIP) and hCMPK2 expression constructs for transient trans- 
fection in HEK293T cells (Supplemenary Table 3, Extended Data 
Fig. 5b). As HEK293T cells do not express viperin in the presence or 
absence of IFN!°, ddhCTP production would not be expected in the 
absence of exogenous viperin expression. HEK293T cells were trans- 
fected with a control plasmid, hVIP alone, hCMPK2 alone or both 
hVIP and hCMPK2, and collected at defined times for LC-MS anal- 
ysis (see Supplementary Information for details). In all cases, over a 
72-h time course, the overall nucleotide pool consistently decreases, 
probably owing to limiting nutrient levels, though the overall growth 
and cell viability are not affected (Supplementary Table 4). In addi- 
tion, at each time point there are no statistically significant differences 
in total nucleotide concentrations between the control, hVIP, and 
hVIP and hCMPK2 treated cells (Extended Data Fig. 6a—d). Notably, 
HEK293T cells transfected with control plasmid exhibited ddhCTP 
levels below our limit of detection of approximately 400 fmol (Fig. 3a, 
right), whereas HEK293T cells transfected with the hVIP plasmid 
exhibited high intracellular ddhCTP levels (approximately 75 |1M 
at 16 h post-transfection) (Fig. 3a, left), which decreases to approxi- 
mately 35 |.M after 72 h. Cotransfection with hVIP and hCMPK2 plas- 
mids resulted in an approximately fourfold increase in the amount of 
ddhCTP to approximately 330 .M at 48 h (Fig. 3a, middle, purple, 
P< 0.0001). Moreover, coexpression of hVIP and hCMPK2 causes the 
ratio of ddhCTP to CTP concentrations to increase over time, whereas 
overexpression of hVIP alone results in a constant ratio of ddhCTP to 
CTP (Extended Data Fig. 6e). This behaviour may enable viperin to 
continue generating ddhCTP even though roughly 30% of the total 
cellular pool of cytidine triphosphates is present as ddhCTP at 48 h. 
These observations demonstrate that in vivo viperin is essential for pro- 
duction of ddhCTP, and suggest that CMPK2 may function to ensure 
that CTP is not limiting in the presence of viperin. 

It is well documented that viperin expression can be robustly induced 
in immune cells by interferon, lipopolysaccharide and double-stranded 
RNA analogues!®, Therefore, we cultured immortalized murine mac- 
rophages (RAW 264.7) in the presence or absence of IFNa in serum- 
free medium, as it has previously been shown that RAW 264.7 cells 
express viperin in an IFNa-sensitive fashion”. When collected after 
19 h, the concentration of ddhCTP was shown to be highly dependent 
on the concentration of IFNa (Extended Data Fig. 7a). RAW264.7 cells 
cultured in the presence of 250 ng ml~! IFNa generated intracellular 
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concentrations of ddhCTP reaching nearly 350 |1M, whereas the intra- 
cellular concentrations of ATP, UTP and CTP were unaltered (Extended 
Data Fig. 7b). Analogous to the behaviour observed in HEK293T cells 
cotransfected with hVIP and hCMPK2, treatment of RAW264.7 cells 
with 250 ng ml~! IFNa resulted in ddhCTP representing a sizable pro- 
portion (that is, 30%) of the total cytidine triphosphate pool (ddhCTP 
(approximately 350 |1M) to CTP (approximately 800 {1M)), whereas the 
level of CTP remained unchanged. This behaviour suggests that the 
viperin-mediated inhibition of viral replication is not a consequence 
of the limitation of the available pool of intracellular CTP, or other 
nucleotides, but rather is dependent on the generation of relevant con- 
centrations of ddhCTP. 

Because members of the flavivirus family are known to be sensitive 
to the catalytic activity of viperin'”, and because of the resemblance of 
ddhCTP to known polymerase chain terminators, we examined the 
effect of ddhCTP on dengue virus RNA-dependent RNA polymerase 
(RdRp) activity. First, we demonstrated that ddhCTP is a substrate for 
dengue virus RdRp using a primed-template assay!* (Extended Data 
Fig. 8a, b). Addition of CTP, 3'-dCTP or ddhCTP led to incorporation 
of all of these nucleotides (Extended Data Fig. 8b). As expected, addi- 
tion of UTP to the CMP-incorporated RNA led to further extension to 
the end of template (Extended Data Fig. 8b, c). However, addition of 
UTP to the 3’-dCMP- or ddhCMP-incorporated RNA did not support 
robust extension (Extended Data Fig. 8b, c), as expected for the action 
of chain terminators. A more stringent test of the effectiveness of a 
chain terminator is direct competition with natural ribonucleotides. 
Therefore, we evaluated RNA synthesis in the presence of increasing 
concentrations of ddhCTP or 3’-dCTP (Fig. 4a). Both ddhCTP and 
3'-dCTP were incorporated and inhibited production of full-length 
RNA (Fig. 4a). Additionally, by titrating ddhCTP at different con- 
centrations of CTP we determined the relative efficiency of the use 
of ddhCTP compared to CTP for dengue virus RdRp, as well as the 
RdRp from West Nile Virus (WNV), another pathogenic flavivirus 
(Fig. 4b, Extended Data Fig. 8e-j). This analysis yielded a 135- and 
59-fold difference in the use of ddhCTP relative to CTP for dengue 
virus and WNV RdRps, respectively. We also evaluated two additional 
members of the flavivirus family, Zika virus (ZIKV) and hepatitis C 
virus RdRps. Both of these RdRps were susceptible to inhibition by 
ddhCTP utilization and chain termination (Fig. 4c, d), consistent with 
studies demonstrating the antiviral activity of viperin against these 
viruses'??!. These data suggest that the flavivirus RdRps would be sus- 
ceptible to inhibition by ddhCTP during replication (Extended Data 
Fig. 9g). Given the efficiency of use and genome size, it is calculated 
that even a probability of incorporating the ddhCTP chain terminator 
during replication of approximately 1% would result in considerable 
reduction of full-length genomes (Extended Data Fig. 9g). To deter- 
mine whether our observations with the flavivirus RdRps extend to 
RdRps from other supergroups, we evaluated members of supergroup 
I. Specifically, we used the RdRps from human rhinovirus type C (HRV- 
C) and poliovirus, which are both members of the picornavirus family 
(Extended Data Fig. 9). Direct-incorporation experiments revealed the 
use of both ddhCTP and 3’-dCTP by HRV-C RdRp (Extended Data 
Fig. 9b). However, in the presence of other rNTPs, both HRV-C and 
poliovirus RdRp were poorly inhibited by ddhCTP (Extended Data 
Fig. 9c-f), even though both are inhibited by 3’-dCTP. On the basis 
of this data, we conclude that not all RdRps are sensitive to ddhCTP, 
suggesting that different viruses will exhibit a range of susceptibilities 
to viperin expression in vivo. 

The in vitro enzymatic characterizations suggest that ddhCTP would 
be sufficient for the in vivo inhibition of viral replication. First, we 
demonstrated that synthetic ddhC nucleoside was capable of traversing 
the plasma membrane of Vero and HEK293T cells and being metab- 
olized to ddhCTP (1 mM synthetic ddhC resulted in the intracellular 
accumulation of 129 |.M and 78 1M ddh-CTP after 24h, respectively) 
(Extended Data Fig. 5c, d). Next, we used the historical African strain 
MR766 (Uganda 1947)? and two contemporary strains, PRVABC59 
(Puerto Rico; 2015)*? and R103451 (Honduras; 2016, GenBank: 
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Fig. 4 | ddhCTP inhibits flavivirus RdRps by a chain termination 
mechanism. a, Primer extension assays show that ddhCTP (0, 1, 10, 100 
and 300 1M) inhibits dengue virus (DV) RdRp. A trinucleotide is used 
to prime RNA synthesis in the presence of nucleotides («**P-NTPs) 
producing a 15-nt product that we refer to as extended primer (ext- 
primer). Additional incorporation of nucleotides leads to production 

of n+ 1 and/orn + 5 product. Experiments repeated independently 
four times with similar results. b, Plot of [incorporated ddhCMP]/ 
[incorporated CMP] versus [ddhCTP]/[CTP]. Data are fit to lines with 


KX262887), to evaluate the antiviral activity of ddhC towards ZIKV 
replication and release from Vero cells. Treatment with ddhC resulted 
in a reduction in ZIKV titres of one to two orders of magnitude, which 
was dependent on dose, multiplicity of infection (MOI), duration of 
infection and strain (Supplementary Table 5). For example, at a MOI 
of 0.1, we observed 50-200-fold reduction in viral titre for ZIKV 
MR766 at all time points (Fig. 5a), with reductions of 5-50-fold also 
observed at a MOI of 1.0 (Extended Data Fig. 10a). ZIKV R103451 
(Honduras) and ZIKV PRVABC59 (Puerto Rico) exhibited analogous 
sensitivities to ddhC treatment (Supplementary Table 5, Extended Data 
Fig. 10a—c). The reduction in viral release was not a result of ddhC 
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Fig. 5 | ddhC reduces ZIKV release in Vero cells. a, Vero cells were 


treated with increasing concentrations of ddhC for 24 h and infected with 
ZIKV MR766 (Uganda) at an MOI of 0.1. PFU, plaque-forming unit. b, For 
viability studies, Vero cells were treated with increasing concentrations of 
ddhC under the same culture conditions used for the antiviral experiment 
described in a. Data are mean + s.d. from three biologically independent 
samples; P values are from a two-way ANOVA with Dunnett’s post hoc 
analysis. 
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slopes of 0.0074 + 0.0006 (DV) and 0.017 + 0.002 (WNV). At each ratio 
of [ddhCTP]/[CTP] a total of at least three independent experiments were 
preformed, with a total sample size of 24. c, Inhibition of the RdRp from 
WNYV, ZIKV and hepatitis C virus (HCV) by either ddhCTP or 3’-dCTP 
(100 1M). Experiments repeated independently four to five times with 
similar results. d, Percentage of inhibition shown. Data are mean + s.e.m. 
from four independent experiments. Gel shown in panel c was quantified 
by using ImageQuant TL software (GE). 


cytotoxicity, as incubation with 1 mM ddhC did not alter Vero cell 
viability (Fig. 5b, see Supplementary Information for details). These 
results, taken together with the in vitro enzymatic analyses, are con- 
sistent with a model in which ddhC-derived ddhCTP inhibits viral 
replication through premature chain termination of RdRp products. 
Of the hundreds of genes stimulated by IFN, most appear to function 
as negative effectors of viral activity, though their mechanisms of action 
remain to be defined. Here we propose a new paradigm for the antiviral 
function of viperin, which relies on its intrinsic catalytic activity to gen- 
erate ddhCTP, a previously undescribed replication-chain terminator. 
To our knowledge, viperin is the only human protein that produces a 
small molecule capable of directly inhibiting viral replication machinery. 
Importantly, overexpression of viperin and production of ddhCTP does 
not appear to adversely affect the growth rate or viability of HEK293T 
or Vero cells. This observation indicates that the host RNA and DNA 
polymerases are not negatively affected by ddhCTP and have developed 
protective mechanisms to exclude incorporation or excise this compound 
during nucleic-acid synthesis; mechanistic studies on the use of ddhCTP 
by host polymerases will be an important area for future investigation. 
In addition to its inhibitory effect on viral RdRp activity, it is possible 
that viperin possesses additional antiviral functions. For example, despite 
reports that HRV infection induces viperin expression, ddhCTP does 
not appear to act as an effective chain terminator for the HRV RdRp”. 
Furthermore, in the case of human cytomegalovirus, viperin expression 
results in enhanced infectivity, possibly through alterations in cellular 
metabolism and disruption of the actin cytoskeleton”. It is probable that 
different pathogens are responsive to distinct subsets of the IFN-inducible 
genes and—given the range of modulatory effects it has on viruses—that 
viperin synergizes with other host- and pathogen-encoded genes. 
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Extended Data Fig. 1 | Purification of rVIP and ddhCTP. a, Amino acid 
sequence from a Lacinutrix mariniflava fusion gene product of CMPK2- 
and a viperin-like protein. b, SDS-PAGE analysis after affinity and 
size-exclusion chromatography. The protein corresponding to amino 

acid residues 51-361 has a predicted molecular mass of 38.36 kDa. This 
construct was chosen because approximately 100 mg of protein could 

be purified from a 2-] fermentation. In addition, the protein is soluble at 
concentrations greater than 2 mM. c, UV-visible spectrum of purified 
rVIP (29.5 »M, UV 280/400 ratio of 4.2). d, Purification of ddhCTP using 
ammonium bicarbonate pH 9, with an elution gradient (dashed line) from 
0.2 M to 0.8 M over 200 column volumes. All results have been repeated at 
least three times. 
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Extended Data Fig. 2 | NMR spectroscopy of ddhCTP. a, *C-C p-p-m. correspond to the beta, alpha and gamma phosphates of ddhCTP, 
COSY spectrum of !3Cy!°N3-ddhCTP. The assignments for the observed respectively. c, 2D-HSQC NMR spectra collected on purified 1 mM 
correlations of the '¥C-connectivities are indicated with the grey dotted ddhCTP in DO. d, 2D-HSQC NMR spectra collected on 1 mM synthetic 


lines. b, 7'P NMR spectra (300 MHz) of ddhCTP (1 mM) in D,O at 300 K. ddhC in DO. All experiments have been repeated twice. 
Three resonance peaks at -19.5 (triplet), —9.5 (doublet) and -3.9 (doublet) 
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Extended Data Fig. 3 | rVIP produces a 1:1 stoichiometry of 5’-dA and 
ddhCTP and reacts specifically with CTP. a, Formation of ddhCTP (red 
squares) and 5’-dA (blue circles) from CTP and SAM in the presence of 
dithionite and 100 1M rVIP. ddhCTP is formed at roughly stoichiometric 
amounts with that of 5’-dA. Data are mean + s.d. from three replicates. 

b, Formation of ddhCTP (open triangle,) and 5’-dA (open circle) from 
CTP and SAM in the presence of the flavodoxin, flavodoxin reductase and 
NADPH using 100 .M rVIP. Data are mean + s.d. from three replicates. 
ddhCTP and 5’-dA are formed at roughly stoichiometric concentrations. 
The production of ddhCTP by this enzyme-driven reducing system 
indicates that ddhCTP formation is not the consequence of a side reaction 
with dithionite. c, High-performance liquid chromatography analysis 

(0 min, blue trace; 20 min, red trace; 12 h, green trace) showing the 
generation of a new peak at 1.55 min in the 12 h sample corresponding 

to a 5’-dA-dithionite adduct in the presence of 100 1M rVIP, 1 mM SAM 
and 10 mM IPP. d, Corresponding mass spectra in ESI negative mode of 
the peak occurring at 1.55 min in the 12 h sample. The 5’-dA-dithionite 
conjugate was calculated to have an exact mass of 315 Da and an m/z of 
314.1. These results have been repeated twice. e, The rate of 5’-dA formed 
by 100 1M rVIP in the presence of 1 mM SAM and 1 mM CTP alone 

or 1 mM CTP with 10 mM IPP or UDP-glucose. Data are mean + s.d. 
from three independent experiments. f, Mass spectrum traces of 5’-dA 
by ESI+. Reactions were conducted with 100 1M rVIP, 1 mM SAM and 
1mM deuCTP with or without 10 mM UDP-glucose. The mass spectrum 
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of 5’-dA produced during these reactions shows only the presence of 
deuterium, which derives from deuCTP, even when UDP-glucose is 
present at a tenfold higher concentration. An m/z of 252.1 represents the 
natural abundance peak of 5’-dA, an m/z of 253.1 indicating the addition 
of one deuterium. g, Mass spectrum trace showing -m/z of 5’-dA formed 
by combining 100 {1M rVIP with 1 mM deuCTP (dotted blue trace) or 

1 mM deuCTP with 1 mM deoxyCTP (red trace). The y-axis of each 
spectrum was normalized to 100% with arbitrary units (au) to enable 
direct comparison between each sample. The 5’-dA produced during 

this reaction has an m/z of 251.1, which is only consistent with rVIP 
abstracting a deuteron from deuCTP and not acting on the deoxyCTP 
(that is, lack of m/z 250.1). h, i, Mass spectrum trace showing -m/z of 
5’-dA (h) or the new product (i), formed by combining 100 .M rVIP with 
either 1 mM CTP (dotted blue trace) or 1 mM deuCTP (red trace). When 
rVIP was incubated with SAM and CTP deuterated at the 2’, 3’, 4’, 5’ and 
5 positions (deuCTP), the —m/z of 5’-dA increased from 250.1 to 251.1, 
consistent with the transfer of one deuterium from deuCTP to 5’-dAe. 
When ddhCTP from the reaction was analysed by mass spectormetry, 
the product exhibited a — m/z of 468.1, indicating that the deuterium 
abstracted by 5’-dA during catalysis did not return to the product. The 
y-axis of each spectrum was normalized to 100% with arbitrary units (au) 
to enable direct comparison between each sample. These results have been 
repeated at least twice. 
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Extended Data Fig. 4 | Viperin abstracts the 4’-H from CTP. a-d, Using and gain of a deuterium in the resulting 5’-dA (right column). The 5’-dA 
CTP with deuterium (7H denoted with a red D) incorporated at either the -m/z increases by one only in reactions containing CTP with a 4/-°H 
2'-7H (a), 3/-7H (b), 4/-7H (c) or 5’-7H) (d) (left column), we were able to (c, right column). Natural abundance peaks are denoted with dashed 
monitor the loss of deuterium from the resulting product (middle column) __ vertical lines. All experiments were repeated twice. 
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Extended Data Fig. 5 | CMPK2 phosphorylates UDP or CDP and amphipathic region (delta 1-42) and hCMPKz2, a control plasmid or cells 
synthetic ddhC can be converted to ddhCTP by cellular machinery. only. Only when the tag is on the N terminus of the full-length hVIP is 
a, Formation of trinucleotide species (UTP, CTP or ddhCTP) from mono- ddhCTP produced at detectable levels. c, ddhCTP concentrations from 
and diphosphate species (1 mM UMP, UDP, CMP, CDP and ddhCDP) HEK293T suspension cells that were incubated with synthetic ddhC (0 or 
in the presence of either ATP or GTP as the phosphate donor by 5 1M 1 mM) for 24, 48 or 72 h (see Supplementary Information for details). 
hCMPK2. b, ddhCTP formation in HEK293T cells expressing either d, ddhCTP concentrations from adherent Vero cells that were incubated 
Flag-hVIP (N- or C-terminal tags), Flag-hVIP without the N-terminal with synthetic ddhC (0, 0.3 or 1 mM) for 24, 48 or 72 h (see Supplementary 
amphipathic region (delta 1-42), hCMPK2 only, Flag-hVIP (N- or Information for details). All experiments were repeated once. 
C-terminal tags) and hCMPK2, Flag-hVIP without the N-terminal nd, not detectable. 
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Extended Data Fig. 6 | Cellular concentrations of nucleotides are 

not affected by viperin expression. a-d, HEK293T cells expressing 
Flag-hVIP (aqua), Flag-hVIP and hCMPK2 (purple) or cells only (dark 
blue). Samples were taken at 16, 24, 48 and 72 hours post infection 

(h.p.i.). Extraction performed with a mixture of acetonitrile, methanol 

and water (40:40:20 and 0.1 M formic acid). Cellular concentrations were 
determined using 13C,!5N,5-CTP, 13C9!5Njo- ATP, 13C)9Ns-GTP and 
13Cg!5N.-UTP spiked into the extraction mixture at known concentrations 
and using equations (1) and (2) in Supplementary Information. Analysis 
of nucleotides ATP (a), CTP (b), GTP (c) and UTP (d) did not show 
statistically significant differences (ns) between Flag-hVIP, Flag-hVIP and 
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hCMPK2 or cells only for any time point. Data are from three biologically 
independent samples. Statistical significance was determined using a two- 
way ANOVA (Supplementary Tables 12, 13, 14 and 15). e, Ratio of cellular 
concentrations of ddhCTP to CTP from HEK293T cells expressing Flag— 
hVIP (aqua) or Flag-hVIP and hCMPK2 (purple); samples were taken at 
16, 24, 48 and 72 h post transfection. The overall ratio of ddhCTP to CTP 
remains constant when only Flag-hVIP is expressed, but the concentration 
of ddhCTP is boosted significantly relative to CTP when both Flag-hVIP 
and hCMPK2 are co-expressed (plots are derived from data shown in c 
and Fig. 3a). 
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Extended Data Fig. 7 | Nucleotide concentrations are not affected grown in serum-free medium in the presence of increasing concentrations 
during ddhCTP production. a, b, Concentrations of ddhCTP (a) and of murine IFNa (10 ng ml~!, 50 ng ml“! and 250 ng ml7!). Data are from 
CTP, UTP and ATP (b) in immortalized macrophage cells (RAW 264.7) two biologically independent samples. 
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Extended Data Fig. 8 | ddhCTP is used as a substrate by dengue virus 
and WNV RdRp and chain terminates RNA synthesis. a, Schematic 

of primer-extension assay for evaluating dengue virus and WNV RdRp 
activity. b, Dengue virus RdRp-catalysed nucleoside incorporation 

using CTP, 3’-dCTP or ddhCTP as nucleoside triphosphate substrates. 
Some full-length product was observed in the presence of ddhCTP 
(more than 99% pure), which is due to residual contaminating CTP 

that could not be removed. c, Reaction products resolved by denaturing 
PAGE containing 40% formamide showing the trace amount of CTP 
contaminate in the ddhCTP preparation. These experiments were 
repeated independently at least four times with similar results. d, Longer 
incubation times and more dengue virus RdRp enzyme does not increase 
the yield of extended product. e, Dengue virus RdRp-catalysed nucleoside 
incorporation with increasing concentrations of ddhCTP (0, 1, 10, 100, 
200 and 750 \1M) at varying concentrations of CTP. This experiment was 
repeated independently three times with similar results. f, Plot of the 
percentage inhibition as a function of ddhCTP concentration at varying 
concentrations of CTP. Data were fit to a dose-response curve to obtain 
half-maximum inhibitory concentration (ICs9) values of ddhCTP of 

60 + 10, 120 + 20, 520 + 90 and 3,900 + 700 iM at 0.1, 1, 10 and 100 1M 
CTP, respectively. This experiment was repeated at least three times 


with similar results. The total sample size is 24. The error reported is the 
standard error from the fit of the data to a dose-response curve. g, Plot 

of ICso values as a function of CTP concentration. The data were fit to a 
line with a slope of 38 + 1 and an intercept of 91 + 25. The error reported 
is the standard error from the fit of the data to a line. h, WNV RdRp- 
catalysed nucleoside incorporation with increasing concentrations of 
ddhCTP (0, 1, 10, 100, 200 and 750 1M) at varying concentrations of CTP. 
This experiment was repeated at least three times with similar results. 

i, Plot of the percentage inhibition as a function of ddhCTP concentration 
at varying concentrations of CTP. Data were fit to a dose-response curve 
to obtain ICs9 values of ddhCTP of 20 + 10, 70 + 10, 300 + 40 and 

2,700 + 300 1M at 0.1, 1, 10 and 100 1M CTP, respectively. The total 
sample size is 24. The error reported is the standard error from the fit 

of the data to a dose-response curve. j, Plot of ICso values as a function 

of CTP concentration. The data were fit to a line with a slope of 27 + 1 
and an intercept of 31 + 8. Both of these results demonstrate that once 
ddhCMP is incorporated, it effectively terminates synthesis and that 

the small amount of extended product is from a trace amount of CTP 
contamination. The error reported is the standard error from the fit of the 
data to a line. 
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Extended Data Fig. 9 | HRV-C and poliovirus RdRp are poorly 
inhibited by ddhCTP. a, Schematic of primer extension assay for 
evaluating HRV-C RdRp activity. b, HRV-C RdRp-catalysed nucleoside 
incorporation using CTP, 3’-dCTP or ddhCTP as nucleoside triphosphate 
substrates. These experiments were repeated independently at least four 
times with similar results. c, Increasing concentrations of ddhCTP does 
not efficiently inhibit HRV-C RdRp-catalysed RNA synthesis. HRV-C 
RdRp-catalysed nucleoside incorporation in the presence of increasing 
concentrations of either ddhCTP or 3’-dCTP. These experiments were 
repeated independently at least five times with similar results. d, Plot of 
the percentage inhibition as a function of either ddhCTP or 3’-dCTP 
concentration. Data were fit to a dose-response curve to obtain ICs 9 values 
of 900 + 300 1M for ddhCTP and 5 + 1 uM for 3’-dCTP. The total sample 
size is eight. The error reported is the standard error from the fit of the 
data to a dose-response curve. e, f, HRV-C (e) and poliovirus (f) RdRp- 
catalysed nucleoside incorporation with increasing concentrations of 
ddhCTP (0, 1, 10, 100 and 200 1M) at varying concentrations of CTP. 
Reactions were performed with the trinucleotide primer, 5’‘-pGGC, and 
20-nt RNA template as described for dengue virus and WNV RdRp to 
directly compare results with HRV-C and poliovirus RdRp. At the highest 


concentration of ddhCTP, only approximately 2% inhibition was observed 
for HRV-C RdRp at 0.1 and 1 sM CTP. The ICs» values at 0.1 and 1 1M 
CTP are estimated to be approximately 10,000 and 20,000 ,.M ddhCTP, 
respectively. These values are three orders of magnitude higher than those 
obtained for dengue virus and WNV RdRp. Reactions in the presence 

of 3’-dCTP (200 1M) are shown as a control for inhibition. These 
experiments were repeated independently at least four times with similar 
results. g, Efficiency of incorporation and inhibition of viral RdRps. 
Footnotes: *Calculated for ddhCTP in direct competition with CTP 
(800 j1M) using the linear equations obtained from the fit of the data 
shown in g and j. For HRV-C, the ICs value was estimated to be two 
orders magnitude greater than that calculated for dengue virus and 
WNV RdRps as evidenced from the data shown in d. 'Calculated 

for a ddhCTP concentration of 350 \.M using the following equation: 
probability = [ddhCTP]/([ddhCTP] + ICso). ‘Calculated using the 
following equation: full — lengthgenome(%) = 100 x (1—probability) 
in which C,, is the number of cytidine residues in the viral genome with 
values of 2,200, 2,497, 1,565 and 1,737 for dengue virus, WNV, HRV-C and 
poliovirus respectively. 
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Extended Data Fig. 10 | ddhC reduces virus release of three different 
ZIKV isolates. Vero cells were treated with different concentrations of 
ddhC (0, 0.1 and 1 mM) for 24 h. After this 24-h period, the medium 

over cells was removed and cells were infected with fresh medium that 
contained the original concentrations of ddhC (0, 0.1 and 1 mM) and one 
of three strains of ZIKV; African strain MR766 (Uganda 1947), PRVABC59 
(Puerto Rico; 2015) or R103451 (Honduras; 2016, GenBank: KX262887). 
After three hours of ZIKV infection, virus inoculum was removed and cells 
were treated with fresh medium that contained the original concentrations 
of ddhC (0, 0.1 and 1 mM). Virus samples were collected and the medium 
over cells was replaced with fresh medium that contained the original 
concentrations of ddhC at 24, 48 and 72 h.p.i. Viral titres at 24, 48 and 

72 h.p.i. were determined using the plaque assay. a—c, Effect of ddhC on 
three different ZIKV isolates: MR766 (Uganda 1947) (a), PRVABC59 
(Puerto Rico; 2015) (b) or R103451 (Honduras; 2016) (c). Analysis of 
ZIKV titres indicates that 1 mM ddhC inhibits all three ZIKV isolates 
compared to 0 mM ddhC. However, reduction in virus titre is more 
prominent at 24 h.p.i. and 48 h.p.i. compared to 72 h.p.i. when using an 
MOI of 1.0. The antiviral effect of ddhC is more prominent at an MOI of 
0.1. Data are mean + s.d. from three biologically independent samples, 

P values from a two-way ANOVA with Dunnett’s post hoc analysis. 
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Cryo-EM structure of the serotonin 5-HTjg receptor 
coupled to heterotrimeric G, 


Javier Garcia-Nafria!*, Rony Nehmé!”, Patricia C. Edwards! & Christopher G. Tate!* 


G-protein-coupled receptors (GPCRs) form the largest family of 
receptors encoded by the human genome (around 800 genes). They 
transduce signals by coupling to a small number of heterotrimeric 
G proteins (16 genes encoding different «-subunits). Each human 
cell contains several GPCRs and G proteins. The structural 
determinants of coupling of G, to four different GPCRs have been 
elucidated“, but the molecular details of how the other G-protein 
classes couple to GPCRs are unknown. Here we present the cryo- 
electron microscopy structure of the serotonin 5-HTj, receptor 
(5-HT sR) bound to the agonist donitriptan and coupled to an 
engineered G, heterotrimer. In this complex, 5-HT,R is in an 
active state; the intracellular domain of the receptor is in a similar 
conformation to that observed for the 32-adrenoceptor (3,AR)? or 
the adenosine A», receptor (A24R)! in complex with G,. In contrast 
to the complexes with G,, the gap between the receptor and the GB- 
subunit in the G,-5-HT sR complex precludes molecular contacts, 
and the interface between the Ga-subunit of G, and the receptor is 
considerably smaller. These differences are likely to be caused by 
the differences in the interactions with the C terminus of the Go 
a-subunit. The molecular variations between the interfaces of G, 
and G, in complex with GPCRs may contribute substantially to both 
the specificity of coupling and the kinetics of signalling. 

Heterotrimeric G proteins can be divided into four subfamilies, G,, 
Gi/o, Gq and Gy2/13, each containing «, 3 and \-subunits. When an ago- 
nist binds to a GPCR, the receptor couples to a G protein, predominantly 
through the a-subunit (Ga); there are relatively few contacts with the 
B-subunit and none with the +-subunit. The overall structure of a-sub- 
units in the inactive GDP-bound state is highly conserved’, and they 
undergo similar conformational changes upon coupling to GPCRs’. This 
is characterized by a disorder-to-order transition of the C-terminal half 
of the a5-helix, which adopts an a-helical conformation upon bind- 
ing in the cytoplasmic cleft of an activated GPCR®. The key role of the 
o5-helix in G-protein coupling to GPCRs has been confirmed through 
mutagenesis studies, which are consistent with the a5-helix accounting 
for 70% of the interactions between Ga and 8,AR*. The amino acid 
sequences of the C terminus of the a5-helix are highly conserved within 
G-protein subfamilies, but are distinct between different subfamilies. 
The key role of this region in determining specificity is indicated by the 
ability to change the coupling specificity of a G protein by mutating the 
C-terminal region to match that ofa different G protein’. However, other 
regions of the a-subunit also contribute to specificity, and mutations 
often do not map directly to specificity when transferred from one G 
protein to another’®. Many receptors couple to more than one G protein, 
and this coupling can appear different depending upon whether cou- 
pling is measured dynamically or in an end-point assay'!. The reported 
structure of 8.AR coupled to heterotrimeric G, changed the model of 
how G proteins couple to and are activated by GPCRs*, but it does not 
address the issue of G-protein specificity. We have therefore determined 
the structure of 5-HT sR in complex with heterotrimeric G, to enable 
comparisons of receptor coupling to G, and Go. 

There are 13 GPCRs in the serotonin receptor family”, all of which 
are expressed in the central nervous system where they have key roles 


in all aspects of behaviour’. Structures of several 5-HT receptors have 
been determined in either inactive or active-intermediate states!+"!° 
5-HT,R binds the agonist donitriptan with high affinity and couples 
to Gijo!”. Go is the most abundant G protein in the brain and an engi- 
neered G,, mini-G,, was developed to form a heterotrimer with the GB 
and Gy subunits, which can bind and stabilize the agonist-activated 
5-HTgR"°. We expressed and purified these proteins and assembled 
them into a complex containing 5-HT,R, donitriptan, mini-Go, Bj 
and ‘jy, subunits (see Methods). The purified complex was vitrified 
on electron microscopy grids and the structure was determined by 
cryo-electron microscopy (cryo-EM) and single-particle analysis to 
an overall resolution of 3.8 A (Extended Data Figs. 1-4, Extended 
Data Table 1), with clear density for the majority of side chains and 
the agonist donitriptan (Fig. 1, Extended Data Fig. 2). Donitriptan 
occupies the orthosteric binding site, and the serotonin-like moiety of 
the ligand binds in a region analogous to that identified for the native 
agonists adrenaline'® and adenosine’? (Fig. 1, Extended Data Fig. 5). 
Donitriptan binds 5-HT;pR in a different mode to the ergot family of 
alkaloids, such as ergotamine and dihydroergotamine’> (Fig. 1). The 
donitriptan-binding site is formed by amino acid residues in trans- 
membrane helices 3, 5, 6 and 7 (H3, H5, H6 and H7) and extends into 
the extracellular region to make contacts with H6, H7 and extracellu- 
lar loop 2 (ECL2). Donitriptan is bound primarily by van der Waals 
contacts and limited polar interactions with Thr134**’ and Asp129°7 


Donitriptan 


5-HT,,R 
B 4 Ergotamine 
. | Donitriptan 
H7 . 
M337 
Y 


Mini-G, 


Fig. 1 | Overall cryo-EM reconstruction of the 5-HT,3R-Go 
heterotrimer complex. The density of the cryo-EM map (sharpened with 
a B factor of —200) is coloured according to the subunit. The inset shows 
the orthosteric binding pocket in 5-HT ,R (light blue) with donitriptan 
depicted as sticks (green, carbon) and its density in the cryo-EM map. 

The lower panel shows a superposition of ergotamine-bound 5-HT)gR 
(pale grey, PDB code 4IAR)'° and donitriptan-bound 5-HT)pR (pale blue). 
Donitriptan (green, carbon) and ergotamine (orange, carbon) are depicted 
as sticks. 


1MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge, CB2 OQH, UK. @These authors contributed equally: Javier Garcfa-Nafria, Rony Nehmé. *e-mail: cgt@mrc-Imb.cam.ac.uk 


NATUR E|www.nature.com/nature 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


Fig. 2 | Go-coupled 5-HT,,R is in an active conformation. 

a-c, Superposition of G-protein-bound receptors: based on H3, H5 and 
H6: 5-HT),R (blue), AzaR (red)' and 8, AR (green)*. Key amino acid 
residues involved in receptor activation are displayed as sticks. d-f, 
Superposition of Gy-coupled 5-HT sR (blue) and the active-intermediate 
state of 5-HT)gR bound to ergotamine (pale grey), based on alignment 


(unless otherwise noted, superscripts indicate Ballesteros—Weinstein 
numbering for GPCRs”), similar to the binding mode of ergotamine. 
However donitriptan and ergotamine bind on opposite faces of the 
binding pocket, and different rotamers of Phe3517°° and Met337°°8 
accommodate the two ligands. The resolution at the ligand binding 
pocket is slightly lower than for the core of the complex and there are 
limitations in the interpretation of the experimental data (see Methods 
and Extended Data Fig. 2). 

The overall conformation of 5-HT sR in the cryo-EM structure is 
consistent with the receptor being in a fully active state. Superposition 
with the active states of 8.AR? and AzaR! shows a high degree of con- 
servation of the cytoplasmic region of the receptors (Fig. 2). In addition, 
rotamers of key conserved amino acid residues (Pro*™, Ile*4°, Phe®4)8 
are almost identical to those in the active-state structures, suggesting 
that 5-HT)pR is also in a fully active state. The structure of a 5-HT,R- 
BRIL fusion bound to the agonist ergotamine has been determined 
and was suggested to be in an active intermediate state owing to the 
partial movement of Hé6 and partial rotamer changes of key amino acid 
residues!°. Comparison of the G,-coupled 5-HT ,R with the ergot- 
amine-bound 5-HT)pR shows an 8 A shift of the cytoplasmic end of 
H6 (C, of Lys311), an inward shift of H7 and H8 by 2 A (C, of Glu374) 
and changes in rotamer of Arg**°, Tyr>** and Tyr’? (Fig. 2). Of note, 
the conformation of the extracellular region of the receptor that forms 
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5-HT,,R-G, 
5-HT,,R 


of the whole receptor. Conformational changes involved in receptor 
activation are highlighted (red arrows) and key residues are shown as 
sticks with the density from the cryo-EM map (mesh). a and d, view 
parallel to the plane of the membrane; b and e, enlarged view of the 
conserved core of the receptors; ¢ and f, view from the cytoplasmic face of 
the membrane. 


the orthosteric binding pocket does not exhibit any marked change in 
conformation in the transition from the active intermediate state to 
the active G-protein-coupled state. This resembles the transition of the 
A>aR”!, but different to that observed in 8,AR°, which has a different 
energy landscape’. 

The overall architecture of 5-HT)R coupled to Go is similar to that 
of the 8,AR-G, and A7,R-G, complexes?”, but there are critical dif- 
ferences in the details. The interface between 5-HT)pR and G, consists 
of 9 amino acid residues from 5-HT)gR and 13 from the a-subunit of 
Go. This compares with 24 residues from 82AR and 20 in AzaR that 
make contact with 17 and 22 residues in Gag, respectively (Fig. 3). 
The surface area of G, in contact with 5-HT)pR is 822 A2, whereas the 
areas of G, in contact with 8,AR and A»,R are 1260 A? and 1135 A?, 
respectively. All the contacts made by G, with the receptor are through 
residues in the a5-helix. By contrast, contacts made by Ga, to 8.AR 
and AR also involve regions in $1, S2-S3 and H4-S6 (Extended Data 
Fig. 6). The overall conformations of Ga; and Ga, are very similar 
when coupled to the receptors (Extended Data Fig. 7). However, align- 
ment of the cytoplasmic regions of 5-HTjR, B.AR and A24R shows 
that the a5-helix of Ga, is positioned differently within the receptor 
compared to Ga, (Fig. 3, Extended Data Fig. 7). There is a 9° or 11° 
tilt of the N-terminal end of the «5-helix away from the plane of the 
membrane when compared to G, coupled to 8,AR or AzaR, respectively 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


Fig. 3 | G, coupling to the 5-HT),R. a, C-terminal end of Ga, (yellow 
sticks) inserted into the cytoplasmic cleft of 5-HT),R (blue cartoon). 
Cryo-EM density is depicted as a mesh. b, Superposition of 5-HT3R 
(blue), AraR (red)! and 82AR (green)° based on alignment of H3, H5 

and H6. The different poses of the C termini of G, and G, coupled to the 
respective receptors are shown. c, Amino acid residues in 5-HT sR, AzaR 
and AR that make contact with the respective Ga subunits that they 
are coupled to are shown in colours that reflect biophysical properties of 
the residues. Green, hydrophobic; yellow, hydrophilic, red, acidic; blue, 


(the pivot point is in the region of 1344—N347 in Gag). The different tilt 
angles are probably the result of the different positions of the approx- 
imately eight C-terminal amino acid residues of the «5-helices 
that are located within the receptor. This region contains the major 
determinants of specificity for different G proteins’. The final four 
amino acid residues in the a5-helix of Ga, are YELL"; the 
equivalent residues in Ga, are C#-2GLY">6 (superscripts refer to 
the CGN numbering system’). These residues form a ‘wavy hook’ 
structure at the end of the a5-helix in Gas. In Gag, the 1-electrons of 
Tyr™53 form extensive contacts with the positively charged Arg>”°, 
which forms the boundary between the cytoplasmic cleft where the 
«5-helix binds and the hydrophobic core of the receptor’. Similarly, 
in Go, Cys351">3 interacts with Arg147**°, although only through 
van der Waals interactions. Therefore, the «5-helix of both G, and 
G, penetrate GPCRs to the same degree. In contrast to Gas, the single 
amino acid residue in Ga, that makes most contacts with the receptor 
is the C-terminal Tyr354">”°, the side chain of which stacks against 
Arg308°”? in 5-HT)gR and also makes a weak polar interaction with the 
same residue. In Ga,, the terminal amino acid Leu394"¢ makes only 
very few contacts with 8.AR and is disordered in the A2,R-G, struc- 
ture. In the A2,4R-mini-G, crystal structure, there are extensive contacts 
between Glu394"54 and three Arg residues in the H7/H8 region of the 
receptor”!; the equivalent residue in G, is Gly352"54, which makes 
only minor contacts with the receptor. Although it appears from the 
cryo-EM structure that all the major contacts between 5-HT1,R and 
Ga, are mediated by the a5-helix of Gay, there is weak density for H5 
and H6 of 5-HTpR that extends towards the a4 helix in Ga, (Extended 
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basic. Residues coloured in white do not make contact to the relevant 

Ga. The amino acid alignment was created in GPCRdb* and secondary 
structural elements and the Ballesteros-Weinstein numbers”” are shown. 
d, Snake plot of 5-HT,R created in GPCRdb with residues making contact 
with Ga, coloured according to their biophysical properties, coded as 

in c. Regions in grey were disordered in the cryo-EM map. e, Cartoon 

of secondary structural elements in Ga, and amino acid residues that 
make contact with 5-HTjgR are depicted and coloured according to their 
biophysical properties as in c. 


Data Fig. 1). It is known that mutations in the «4 helix can affect cou- 
pling to 5-HT,R”’, but it is unclear from the structure whether this 
is because direct contacts to the receptor are absent from the mutated 
«4 helix, or because there is a secondary effect of the mutation on the 
structure of Gao. 

The determinants of coupling specificity of G proteins are found pre- 
dominantly at the C terminus of Ga in the a5-helix and the wavy hook. 
The architecture of this region is virtually identical in Ga, and Gag, 
but the differences in amino acid sequence (Extended Data Figs 6, 8) 
result in Ga, being bulkier than Ga, in the terminal five residues 
(Extended Data Fig. 9). This may be sufficient to prevent coupling of 
G, to some G,-specific GPCRs as the narrower crevice in these GPCRs 
may exclude the bulkier C terminus of G;. Conversely, the wider crevice 
in G,-coupled receptors may allow coupling of Go, provided that there 
are suitable residues lining the crevice to form a good interface. This 
last caveat raises the problem of predicting G-protein-coupling spec- 
ificity. Although the structure and mechanism of GPCRs are highly 
conserved®4, human GPCRs show considerable sequence hetero- 
geneity; therefore, there is little or no specific amino acid conservation 
correlating with the subtype of G protein that a receptor couples to”. 
In addition, there is potential for different GPCR conformations”®, 
which suggests that the mode of G-protein coupling could be different 
between different receptors. This is the case in the complex of trans- 
ducin peptide and opsin”’, in which the 05-helix is tilted by around 30° 
in comparison to the a5-helix of Go, even though the G proteins are 
in the same family. More structures need to be determined to evaluate 
the diversity of G-protein coupling. 
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Fig. 4 | Comparison of G, and G, coupling. a, Cartoon of 82AR (green) 
coupled to G, (PDB code 3SN6)’. The «-helical domain of Ga has been 
removed for clarity. b-d, Surface-rendered views of the interface between 
receptor and G protein: BAR (green) and G, (b); AzaR! (red) and G, (c); 
5-HT ,R (blue) and G, (d). 


The specific differences in packing at the C terminus of Gy compared 
to G, have a disproportionate effect on the whole G protein owing to their 
amplification as a result of the different insertion angle of the «5-helix. 
This results in a change in the tilt of the whole G protein, which moves 
away from the plane of the membrane and results in a gap between the 
rest of the G protein and 5-HTj,R. Therefore, there are no contacts 
between 5-HT)pR and G6 subunits, and the only contacts made to Ga 
are with the a5-helix. This is in marked contrast to the relatively close 
packing of G, to both A>,R and 3,AR (Fig. 4). Given that the mechanism 
of GPCR and G-protein activation is conserved’, it is likely that the 
small interface between 5-HT),R and G, is a common feature of recep- 
tor coupling with Gi. family, and will be seen in other GPCRs that are 
activated by diffusible ligands. A likely consequence of the small interface 
in the receptor-Gj/. complex is that Gi. may have a faster rate of dissocia- 
tion than G, in the same GPCR. The kinetics of the steps in GPCR signal- 
ling pathways are thought to have a profound effect on which particular 
signalling event results from agonist binding to a receptor in a specific 
cell type®”®. A combination of structural data and kinetic analyses will 
be essential to unravel the complexities of this system. 
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METHODS 


Expression and purification of 5-HT,,R. N-terminally truncated wild-type 
human 5-HT},R (residues 34-390) was modified to contain a C-terminal his- 
tidine tag (His;o) and TEV protease cleavage site!”. The L138W*! mutation was 
introduced to increase thermostability. Recombinant baculoviruses expressing 
5-HT),R were prepared using the flashBAC ULTRA system (Oxford Expression 
Technologies). Trichoplusia ni cells (Expression Systems) were grown in suspension 
in ESF921 media (Expression Systems) to a density of 3 x 10° cells/ml, infected with 
5-HT},R baculovirus and incubated for 48 h. Cells were harvested and membranes 
prepared by two ultracentrifugation steps in 20 mM HEPES pH7.5, 1 mM EDTA, 
1 mM PMSF. Membranes were resuspended finally in 20 mM HEPES pH7.5, 
500 mM NaCl, 5 mM MgCh, 10 mM imidazole and Complete protease inhibitors 
(Roche) and flash frozen in liquid nitrogen and stored at —80 °C. 

Membranes from 2 | of cells were solubilised with 2% n-decyl-3-p-maltopyra- 
noside (DM) on ice for 30 min in the presence of 1 |.M donitriptan hydrochloride. 
The sample was clarified by ultracentrifugation and loaded onto a 5 ml Ni-NTA 
column (Generon). The column was equilibrated and sample was loaded in buffer 
A (20 mM HEPES pH 7.5, 500 mM NaCl, 1 mM MgCh, 50 mM imidazole, 1 1M 
donitriptan hydrochloride, 0.15% DM), and eluted with buffer B (20 mM HEPES 
pH 7.5, 100 mM NaCl, 1 mM MgCh, 300 mM imidazole, 1 |1M donitriptan hydro- 
chloride, 0.15% DM). The eluate was concentrated using a 50-kDa cut-off Amicon 
centrifugal ultrafiltration unit (Millipore), and exchanged into desalting buffer 
(20 mM HEPES pH 7.5, 100 mM NaCl, 1 mM MgCh, 1 \.M donitriptan hydro- 
chloride, 0.15% DM) using a PD10 column (GE Healthcare). Then, 2.5 mg TEV 
protease was added, and the sample was incubated on ice overnight. TEV protease 
was removed by negative purification on Ni?*-NTA resin. The sample was concen- 
trated to ~1 ml and loaded onto a Superdex 200 column (GE Healthcare). Peak 
fractions corresponding to monomers of receptors were pooled and concentrated. 
A typical yield was 1-2 mg pure 5-HTpR per litre of culture. 

Formation of a 5-HT,R-heterotrimeric mini-G, complex. Purified 5-HTjpR 
was mixed with a 1.2-fold molar excess of mini-G,18;7) in the presence of apyrase 
(0.2 U/ml) and the mixture was incubated on ice overnight”. The sample was 
loaded on to a Superdex 200 column. Peak fractions containing the 5-HT,gR- 
mini-G,)31j2 complex were pooled and concentrated to 4 mg/ml. 

Cryo-grid preparation and data collection. Cryo-EM grids were prepared by 
applying 3 \.l sample (at a protein concentration of 2.2 mg/ml) on glow-discharged 
holey gold grids (Quantifoil Au 1.2/1.3 300 mesh). Excess sample was removed by 
blotting with filter paper for 3-4 s before plunge-freezing in liquid ethane using 
a FEI Vitrobot Mark IV at 100% humidity and 4 °C. Images were collected on 
a FEI Titan Krios microscope at 300 kV using a Falcon III detector in electron 
counting mode and a Volta phase plate. EPU software (FEI) was used for automatic 
data collection. Data were collected in nine independent sessions to give a 
total of 5,737 movies. Each micrograph was collected as 75 movie frames at a dose 
rate of 0.5 e~/pixel/sec (0.4 e~/A” per frame) for 60 s, with a total accumulated 
dose of ~30 e~/A2. The magnification used was 75,000 x, yielding a pixel size of 
1.06 A/pixel. 

Data processing and model building. RELION-2.1 was used for all data process- 
ing?! unless otherwise specified. Since data were pooled from nine independent 
sessions we provide here the general strategy for data collection and process- 
ing, while precise particle numbers for a representative dataset are presented in 
Extended Data Fig. 3. Overall, drift, beam induced motion and dose weighting 
were corrected with MotionCor2” using 5 x 5 patches. CTF fitting and phase shift 
estimation were performed using Gctf v.0.1.06*°, which yielded the characteristic 
pattern of phase shift accumulation over time for each position. Generally, 40 
images were taken at each Volta phase plate position. Auto-picking was performed 
with a Gaussian blob as a template* which readily resulted in optimal particle 
picking. Particles were extracted in a box of 150 pixels (159 A) and inputted into a 
one or two reference-free 2D classification (if the majority of 2D classes had non- 
recognizable or low-quality features, then the selected particles belonging to quality 
classes were taken to a second round of 2D classification). An ab initio model was 
generated using 10,000 particles with RELION 2.1°° in the first data collection and 
used throughout. The resulting particles after 2D classification where then used for 
3D classification in both three and four classes simultaneously in order to check for 
consistency in 3D classification and to generate models with different numbers of 
particles. The models with the best defined features were selected for refinement 
either on their own or together with a second class from the same 3D classifica- 
tion (if more than one quality model was present). The particles that reached the 
highest resolution after gold-standard resolution estimation were saved. Particles 
obtained in a similar fashion from the different sessions were then merged and 
refined together. During refinement, the low-pass filter effect of the Wiener filter 
in the regularised likelihood optimisation algorithm was relaxed through the use 
of a regularisation parameter (T = 3). This allowed the refinement algorithm to 
consider higher spatial frequencies in the alignment of the individual particles. 
Nevertheless, both half-reconstructions were kept completely separately, and the 
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final resolution estimate (at the post-processing stage in RELION) was based on 
the standard Fourier shell correlation (FSC) between the two unfiltered half- 
reconstructions. The final model contained 730,118 particles and reached an overall 
resolution of 3.78 A with side chains visible for most of the complex (Extended 
Data Figs. 1, 2). Local resolution estimates were calculated with Resmap* showing 
a core of the protein at ~3.5 A resolution and an extracellular region of the recep- 
tor and 8+ N termini at poorer resolution with the worst regions reaching ~5 A 
(Extended Data Fig. 1). Signal subtraction of the DM micelle did not improve the 
quality of the map upon refinement. 

Model building and refinement was carried out using the CCP-EM software 

suite’’. The 5-HT),R-ergotamine crystal structure was used as a starting model 
(Protein Data Bank (PDB) accession 4IAR)!° for receptor building. 5-HT13R was 
modelled from residue L45 to R385. Although density was present from Y38 
and this region seems to adopt a similar conformation to the 5-HTj,R crystal 
structure, the poor resolution in this region prompted us to leave it unmod- 
elled. Residues R188 to V196 in the ECL2 and 1339 to C344 in ICL3 were flex- 
ible with absent or very poor map density and were therefore, not modelled. 
For the same reason residues K241 to L304 forming the large 5-HTgR ICL3 
loop were left unmodelled. Mini-G, was modelled from residue L5 to Y354 fol- 
lowing native Ga, numbering. Modifications in Ga, to obtain mini-G, are as 
described’? (Extended Data Fig. 7). Although 8 and ¥ subunits were modelled 
using the available crystal structures, poor density was found for both N ter- 
mini, with the whole of the + subunit having poor density. For this reason the 
worst regions of these subunits were modelled as poly-alanine. Initial manual 
model building was performed in Coot** following a jelly-body refinement in 
REFMAC5*’. Donitriptan coordinates and library were created with JLigand”” 
and manually fitted into the density using sphere real space refinement in 
Coot. Restraints were generated with ProSMART“! in order to maintain 
structural features in regions of poorer density. B factors were reset to 40 A? 
before refinement. The model then followed cycles of manual modifications in 
Coot and restraint refinement in REFMACS. The final model achieved good 
geometry (Extended Data Table 1) with validation of model performed in Coot, 
Molprobity”? and EMRinger*’. The goodness of fit of the model to the map was 
carried out using Phenix“ using a global model-vs-map FSC correlation (Extended 
Data Fig. 2). Overfitting in refinement was monitored throughout using FSCyork/ 
FSCres*. 
Note on limitations of the interpretation of density in the ligand binding 
pocket. The ligand binding pocket of 5-HT)gR is occupied by a single molecule 
of the agonist donitriptan. Despite the resolution varying between 3.8 A to 4.3 A 
in this region, estimated from the local resolution map (Extended Data Fig. 1), the 
density allowed modelling of the position and orientation of donitriptan and the 
majority of the amino acid side chains in the pocket. However, the resolution limits 
the accuracy of the refined coordinates and care must be taken when analysing 
the precise details of any potential interactions. The best resolution is towards the 
centre of the membrane bilayer and resolution gets worse towards the extracellular 
surface of the receptor. The ligand has been modelled using real space refinement, 
taking into account the location of nearby residues as well as using a library of 
restraints with allowed conformations of donitriptan. The density allowed mod- 
elling of the position and orientation of the donitriptan molecule, with the indole 
group buried deep in the orthosteric binding pocket and the remainder of the 
ligand protruding towards the extracellular surface. 

Clear interpretable density is found for large aromatic groups, while density 
is poorer for residues with smaller side chains such as Ser334°** and Ser212>”. 
Tyr3597*3, Phe330°°! and Phe331° are positioned around the indole group at the 
base of the pocket and Phe3517*° interacts with the donitriptan aromatic moiety 
at the most extracellular region. Met337°°* is located in a region of poor density 
and has been modelled so that is oriented away from the pocket and interacting 
with the aromatic group of donitriptan. This was concluded based on interpreta- 
tion of maps with different sharpening levels, but its rotamer cannot be assigned 
with confidence. The orientation of the primary amine on the serotonin moiety 
in donitriptan and the adjacent side chain of Asp129°** cannot be confidently 
assigned owing to poor density. However, Asp129°*? is absolutely conserved in 
all the human serotonin GPCRs and forms a hydrogen bond with ergotamine 
in the high-resolution crystal structure of 5-HT gR. We have therefore modelled 
Asp1293 in a similar rotamer to make a potential hydrogen bond with this pri- 
mary amine in donitriptan, despite the lack of density for both the primary amine 
and the carboxyl group of Asp1293*”. 

Reporting summary. Further information on experimental design is available in 
the Nature Research Reporting Summary linked to this paper. 

Data availability. All data generated or analysed during this study are included in 
this published article and its Supplementary Information. The cryo-EM density 
map has been deposited in the Electron Microscopy Data Bank under accession 
code EMD-4358 and the coordinates have been deposited in the Protein Data Bank 
under accession number 6G79. 
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Extended Data Fig. 1 | Cryo-EM single particle reconstruction of 

the 5-HT)g3R-G, complex structure. a, Representative micrograph 
(magnification 75,000 x, defocus —0.6 1m) of the 5-HT|3R-G, complex 
collected using a Titan Krios with the Falcon III detector and Volta phase 
plate. b, Representative 2D class averages of the 5-HT),R-G, complex. 

c, FSC curve of the final reconstruction showing an overall resolution of 
3.8A using the gold-standard FSC of 0.143. Both masked and unmasked 
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FSC curves are shown to highlight the lack of masking artefacts. d, Final 
reconstruction coloured by subunit. Inset shows a magnified view of 
the weak density for ICL3. The magnified region corresponds to a map 
sharpened with B = —50 to remove noise from lower density levels. 

e, Local resolution estimation of the 5-HT,R map as calculated by 
Resmap. 
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Extended Data Fig. 2 | Cryo-EM map quality and model validation. site. d, FSC of the refined model versus the map (green curve) and 
a, Transmembrane helices of 5-HT pR. b, The «5-helix of Go. FSCwork/FSCtest validation curves (blue and red curves, respectively). 


c, Donitriptan and the neighbouring side chains in the orthosteric binding 
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Extended Data Fig. 3 | Flow chart of data processing. Micrographs were 
collected during nine sessions on the Titan Krios (either 24 h or 48 h) and 
each session was processed independently. The number of images and 
particles from one 48-h session is indicated on the flowchart as a guide. 
At the bottom of the figure, the final number of particles is shown. Each 
dataset was corrected separately for drift, beam-induced motion and 
radiation damage. After CTF estimation, particles were picked using a 
Gaussian blob and submitted to either one or two rounds of reference-free 
2D classification (see Methods). A 3D classification was performed on the 


Final model-3.8 A 
730,118 particles 


selected particles using an ab initio model generated from ten thousand 
particles. Classification was performed in parallel in three and four classes. 
The models with best features were refined on their own; if there were 

two classes of similar high quality, these were then re-refined together 

(the resolution of the models refers to the resolution after refinement 

and calculation of gold-standard FSC = 0.143). The set of particles that 
obtained the best map quality and resolution were saved and merged with 
the best particles from other datasets. A final model with 730,118 particles 
was refined and achieved a global resolution of 3.78 A. 
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a 34 SAKDYTYOQDSTSLPWKVLLVMLLALITLATTLSNAFVIATVYRTRKLHTPANYLIASLAV 93 
94 TDLLVSILVMPISTMYTVTGRWTLGOVVCDFWLSSDITCCTASIWHLCVIALDRYWAITD 153 


154 AVEYSAKRTPKRAAVMIALVWVFSISISLPPFFWROAKAEEEVSECVVNTIDHILYTVYST 213 
214 VGAFYFPTLLLIALYGRIYVEARSRILKOTPNRTGKRLTRAQLITDSPGSTSSVTSINSR 273 
274 VPDVPSESGSPVYVNOVKVRVSDALLEKKKLMAARERKATKTLGIILGAFIVCWLPFFII 333 
334 SLVMPICKDACWFHLAIFDFFTWLGYLNSLINPIIYTMSNEDFKOAFHKLIRFKCTS 390 


b 
Extended Data Fig. 4 | Modelling quality of the 5-HTjgR structure. residue was not modelled. Regions highlighted in grey represent the 
a, Amino acid sequence of the 5-HT),R construct used for the cryo-EM transmembrane a-helices, and amphipathic helix 8 is highlighted in 
structure determination. Residues are coloured according to how they yellow. b, Model of 5-HT,gR showing the C, positions of amino acid 
have been modelled. Black, good density allows the side chain to be residues with poor density (spheres) and unmodelled regions (dotted 
modelled; red, limited density for the side chain, therefore the side chain lines). 


has been truncated to Cg; blue, no density observed and therefore the 
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Extended Data Fig. 5 | Superposition of donitriptan, adrenaline and 
adenosine bound to their respective receptors. 5-HT gR, 82AR? and 
AjaR! were superimposed (using Pymol) over the whole of the receptor. 
Green, donitriptan; pink, adrenaline; blue, adenosine. 
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Extended Data Fig. 6 | Comparison of the amino acid sequences of the residues in Ga, in which only the main chain atoms make contacts with 
a-subunits of G, and G,. Diamonds above the sequences indicate amino the receptor. Secondary structural elements are indicated as grey bars with 
acid residues in Ga, in which the side chains make atomic contacts to positions numbered according to the CGN numbering system. 

residues in BAR (82 con) or AzaR (2A con). Ovals indicate amino acid 
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Extended Data Fig. 7 | Similarity of Ga structures and the difference coupled to 5-HT),R; green, Ga, coupled to A24R; Ga, coupled to 8,AR. 
poses of the «5-helices in Ga, and Ga, coupled to receptors. a, The b, 5-HT ,R (blue), 82AR? (green) and Az4R! (red) were superimposed 
structures of the «-subunits in complex with 5-HT)gR, 62AR? and Az,R! based on H3, H5 and H6. Two different views are shown with the red 
were superimposed over the whole of their sequence in Pymol. Blue, Gay arrows indicating differences in orientation of Ga, and Gag. 
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Extended Data Fig. 8 | Alignment of the amino acid sequences of Gy acids are highlighted as follows: pink, stabilizing residues required 
and G; «-subunits. Residues in grey correspond to the «-helical region to generate mini-G,; yellow; residues in Gay that are different from 
that does not make contact with GPCRs and was deleted during the residues conserved in all three Ga; sequences; blue, residues that are 
construction of mini-Gy. Secondary structural elements are depicted non-conserved in Gay sequences. #, the affinity tag on mini-G, used for 
as grey bars with the CGN numbers shown to aid comparisons. Amino purification (MGHHHHHHENLYFQG). 
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Extended Data Fig. 9 | Comparison of the «.5-helices of G, and Gy. The sequence and displayed in different poses: cartoon depiction (a); G, (green 
a5 helices in the cryo-EM structures of A7,R-G, (carbon, green) and spheres), Go (blue sticks) (b); G, (blue spheres), G, (green sticks) (c). 
5-HT),R-G, (carbon, light blue) were aligned (in Pymol) along their whole 
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Extended Data Table 1 | Data collection and refinement statistics 


Data collection and processing 


Magnification 
Voltage (kV) 
Electron exposure (e-/A’) 
Defocus range (um) 
Pixel size (A) 
Symmetry imposed 
Initial particle images* (no.) 
Final particle images (no.) 
Map resolution (A) 
FSC threshold 
Map resolution range” (A) 


Refinement 

Initial model used (PDB code) 

Model resolution® (A) 
FSC threshold 

Map sharpening B factor (A*) 

Model composition 
Non-hydrogen atoms 
Protein residues 
Ligands 

B factors (A?) 
Protein 
Ligand 

R.m.s. deviations 
Bond lengths (A) 
Bond angles (°) 

Validation 
MolProbity score 
Clashscore 
Poor rotamers (%) 
EMRinger score 

Ramachandran plot 
Favored (%) 
Allowed (%) 
Disallowed (%) 

@After 2D classification. 


Local resolution range. 


5-HTiz 7 MiniGopy 


(EMDB-4358) 
(PDB 6G79) 


1,249,822 
730,118 
3.78 
0.143 
~3.4 to ~4.6 


5G53, 3SN6 


°Resolution at which FSC between map and model is 0.5. 
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Scientists such as Yue Wan are reaping the rewards of international experience. 


Full circle 


Researchers who return home to East Asia aim to maintain networks and global standing. 


BY VIRGINIA GEWIN 


any junior scientists from East Asia 
M do their degree or post-PhD training 

abroad, but return home to develop 
their careers. The booming economies of 
several East Asian regions offer significant 
opportunities for new principal investigators. 
Nature spoke to four researchers who have 
recently returned to East Asia about the oppor- 
tunities and challenges they face as they launch 
labs, seek talented lab members and forge and 
maintain collaborations. 


HUBS OF ASIAN SCIENCE 


A Nature collection 
nature.com/collections/asianhubs 


YUE WAN 
Be adaptable 


Structural genomicist, Genome 
Institute of Singapore 


When I returned to Singapore in 2013 to 
start my own lab, I was excited to reacquaint 
myself with my Asian culture, especially as it 
becomes less conservative due to globalization. 
I had spent the previous decade in the United 
States, after receiving a scholarship from the 
Singaporean government’s Agency for Science, 
Technology and Research (A*STAR) to develop 
well-trained PhDs to support Singapore’s 


budding biomedical industry. Singapore is 
a small republic that built itself from almost 
nothing, with no natural resources, to a first- 
world city-state that prides itself on excellence, 
perseverance and resilience. 

After earning my bachelor’s degree in 
cell biology at the University of California, 
San Diego, I ended up at Stanford University 
in California for my PhD, where we developed 
a new protocol for conducting genome-wide 
measurements of RNA structure in yeast 
(M. Kertesz et al. Nature 467, 103-107; 2010). 
We followed that work with studies of RNA- 
folding stability in yeast, and on the variation 
in RNA structure across the human transcrip- 
tome, the collection of gene readouts ina cell 
(Y. Wan et al. Nature 505, 706-709; 2014). 

Most A*STAR fellows return as postdocs, > 
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> but in 2013, Singapore launched a fellowship 
programme that offered newly returned PhD 
holders enough money to start their own labs 
straight away. That same year, I became the first 
fellow at the Genomics Institute of Singapore. 
All of a sudden I had to work out how to hire 
people, create a budget and get orders in. It was 
a steep learning curve. 

It was also an oddly isolating position. Most 
of my colleagues, and even I, didn’t know what 
being an independent fellow meant. Postdocs 
didn't hang out with you, and neither did princi- 
pal investigators (PIs). Former mentors advised 
me to make myself visible by speaking up at 
every staff meeting. That is hard, especially as 
a female scientist at an institute with few female 
Pls. But I realized that I had the money and 
independence to do what I wanted, so I did. 

Singaporean research tends to be more 
applied and translational than US science — 
geared towards areas of strategic importance to 
society. One challenge is that priorities for fund- 
ing can shift every 5-10 years to shape economic 
growth, so researchers must be adaptable. 

In 2016, Ireceived an A*STAR investigator- 
ship, which gave me 6 million Singapore dol- 
lars (US$4.4 million) for 6 years and allowed 
me to take a senior-scientist post. I also earned 
several international fellowships, including a 
Branco Weiss Fellowship from the Swiss Fed- 
eral Institute of Technology (ETH) in Zurich, 
a LOreal-UNESCO (United Nations Educa- 
tional, Scientific and Cultural Organization) 
fellowship for women in science and an EMBO 
Young Investigator Award, which helped me to 
collaborate with scientists worldwide. 

I’ve had my lab for five years, and have had 
two children during that time. Singapore cur- 
rently offers four months’ paid maternity leave 
for most citizens, but I didn't take it either time. 
I worried that my lab would tank if I were away 
for four months. I took three weeks off after each 
birth. For the following two months, I worked 
two hours a day to help my staff members to stay 
on course. Then I returned to work full time. 

The Asian family structure is close knit. My 
parents live close by and are happy to take care 
of their grandchildren. Hired help is more 
affordable here than in the United States. With- 
out those support structures — including my 
husband, who splits his time between his labs 
at the Genome Institute of Singapore and Nan- 
yang Technological University — there would 
be no way for me to put in the hours necessary 
to run my lab. 


NETHIA KUMARAN 
Maximize visibility 
Cancer biologist, University of Science 
Malaysia 


Since returning to Malaysia after four years 
in Australia, ’'ve noticed that people are 
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more aware of the importance of science 
and engineering, and that more girls are 
encouraged to pursue science. I first became 
interested in cancer biology as an under- 
graduate in microbiology, when I learned 
that viruses could cause cancer. I was study- 
ing at the University of Science, Malaysia, and 
searched for a mentor, but there weren't many 
experts in that field in Malaysia at the time. 
In 2005, after earning my bachelor’s degree, 
I moved to the United Kingdom for a master’s 
in oncology at the University of Nottingham. 
The Malaysian government offered full schol- 
arships for people 
doing a PhD in or 


: : “Global 
outside Malaysia. gollaborition 
Because there were ; 

fe -biolo 1s not an easy 
so few cancer. gy 
experts in Malaysia, I putes’ = 
went to the Univer- sometimes | 
sity of Sydney, Aus- don’t even get 
tralia, wherelearned “llSwers to 
a PhD in 2012, stud- @-mailsIsend 
ying the cell-death 0 researchers 
pathways of cancer. overseas. 

Thad two excellent 


mentors in Australia. They helped me to bet- 
ter articulate my findings, learn to collaborate 
and to speak in public at international confer- 
ences. I focused on building my network and 
collaborations. 

I wanted to do a postdoc to get more train- 
ing and experience, but Malaysia requested 
that their funded scholars come back as soon 
as we finished our PhDs. It was a big jump to 
being a PI when I chose to return to the Uni- 
versity of Science, Malaysia, in 2013. Located 
on the beautiful island of Penang, the univer- 
sity sits in one of the country’s more culturally 
liberal areas. 

My mentors in Sydney had taught me that 
getting a grant isn't just about the proposed 
research details. Success is also about clearly 
telling funders why this work is important, 
how it fits into the bigger picture and why 
they should give me the money to do it. 
Although that was helpful, I still needed men- 
tors in Malaysia to help me navigate our grant 
system. The culture and academic protocols 
can be very different at home. It was a painful 
learning process, but I secured grants from 
the university and one from the Ministry of 
Higher Education. 

In my five years as a PI, research funding in 
Malaysia has been cut and grants have become 
increasingly competitive, but my lab has two 
PhD students and two master’s students. 
Cancer biology is still a small community in 
Malaysia, although I do have a few collabora- 
tors here. I also apply for international grants, 
but that requires international collaborators. 
And global collaboration is not an easy pro- 
cess — sometimes I don’t even get answers to 
e-mails I send to researchers overseas. 

Right now, I’m establishing my visibility. 
Because I can’t go to international conferences 
every year due to a lack of funding, I organized 


Nethia Kumaran 


an international conference in Malaysia in 
2017. Winning a LOreal-UNESCO fellowship 
in 2016, after applying for four years in a row, 
also really helped to improve my visibility. 

Having said that, I would have to think = 
hard about leaving Malaysia to take a job in 
another country. The jobs here are perma- 
nent, whereas it seems like there are more 
temporary positions abroad. That scares mea 
bit. Ideally, P'll go on sabbatical overseas, pick 
up new skills and come back home. 


ETHIA KUMARAN 


MYUNGEUN SEO 
Stay competitive 


Polymer chemist, Korea Advanced 
Institute of Science and Technology 


In 2012, I co-authored a paper in the journal 
Science with my postdoctoral adviser Marc 
Hillmyer, a chemist at the University of 
Minnesota in Minneapolis (M. Seo et al. 
Science 336, 1422-1425; 2012). It took two 
years to complete, but he gave me the time and 
space to put together a whole story. A two- 
author paper in a high-impact journal is rare 
these days given the pressure to publish, but I 
believe that it helped me to secure my current 
job at Korea Advanced Institute of Science and 
Technology (KAIST), where I also earned my 
master’s and PhD. 

Since my appointment in 2013, ’'m happy to 
be back in the science-oriented city of Daejeon, 
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which has more than 30 national laboratories as 
well as industrial research centres. As a chem- 
istry student at KAIST, I visited laboratories in 
Japan and the United States while researching 
the self-assembly of small molecules into nano- 
structures. In 2007, during my PhD, I spent six 
months as a visiting researcher at the University 
of California, Santa Barbara, where I learnt a 
new type of polymerization — one that I still 
use today, and which has had a huge impact on 
my research path. I published a paper (M. Seo 
et al. Macromolecules 41, 6413-6418; 2008) that 
has been cited 100 times so far. These experi- 
ences encouraged me to apply for several post- 
docs abroad, which is how I ended up in Marc’s 
lab in 2009. 

After four years, I returned to South Korea, 
and the on-site interview for my job at KAIST 
was one of the toughest days of my life. The 
institute competes with other universities for 
scientists who will be highly visible in their 
fields. But the country is experiencing a big 
wave of retirements right now, so there are 
good job opportunities. 

My start-up package let me purchase a 
lab bench, fume hood and some crucial 
instruments to fill my empty space in a 
new building. I started without a postdoc 
or technician because it is difficult to find 
good candidates here. I have 13 students, 4 of 
whom are female, and 1 postdoc, from India. 
KAIST successfully increased the number 

_ of foreign students and foreign and female 
= faculty members to 10% of the total school 
$ population and now aims to increase those 


Myungeun Seo 


numbers by another 10%. I try to give my 
students the time they need to put a research 
story together, as Marc did for me. Three of 
the papers I have published since starting my 
lab have had a student as the only co-author. 

The grant funding situation fluctuates year 
to year — and is highly competitive, with a 
roughly 10% success rate. Despite this, I 
recently learnt that in addition to a personal 
grant to continue my basic research, a team 
that I am part of has won a 7-year grant of 
US$10 million to explore the chemical archi- 
tecture of self-assembled molecules. 

Tl probably apply for tenure in 2020. Since 
KAIST’s graduate school of nanoscience and 
technology launched in 2008, only one person 
has sought, and ultimately achieved, tenure. 
Publishing a number of papers is always 
helpful, but the content is considered more 
important, as is which journal you publish 
in. International visibility is also key to secur- 
ing tenure. In March, with US colleagues, I 
organized symposia at the annual American 
Chemical Society meeting in New Orleans, 
Louisiana, that brought together world leaders 
studying how complex polymers self-assemble. 


KIM HEI-MAN CHOW 
Stay connected 


Neuroscientist at Hong Kong University 
of Science and Technology 


In 2013, Karl Herrup, a neuroscientist at Hong 
Kong University of Science and Technology, 
advertised for someone with expertise in Wnt 
signalling, a crucial pathway that regulates the 
fate of cells. He hired me, even though I was a 
cancer biologist with no background in neuro- 
science, in hope that a fresh perspective would 
prove valuable. 

Id studied Wnt signalling as a molecular- 
biology PhD student at the University of Hong 
Kong. During my third year, I presented my 
research identifying a potential drug target that 
could regulate Wnt signalling at the American 
Association for Cancer Research meeting in 
Washington DC. 

After my presen- 


“The bond is tation, I received two 
stronger when invitations to pur- 
you can talk sue a postdoc in the 
in person and United States. 
discuss current I chose to go to 
research Cornell University in 
together.” Ithaca, New York, to 
work with biomedical 
engineer Xiling Shen 


in 2010, but I had never used a computational- 
biology or systems-biology approach. My 
experience at Cornell rewired my approach to 
biomedical research. 

One reason that clinical trials can fail is 
that we tend to look at only one thing at a 


time. With systems biology, we first look at 
the whole picture and screen whether a gene 
or protein is the major cause of disease. Then 
we use molecular-biology tools to focus the 
research. Six months after arriving at Cornell, 
however, I had to return home to care for my 
sick mother. I couldn't devote myself to full- 
time research, so I worked for a medical editor 
for a year while my mother recovered. 

My first year in Karl's lab was frustrating, 
because I had to learn about the brain from 
scratch, but I decided to seize this chance to 
expand my skills and launch a second career 
applying a systems-biology approach to study- 
ing Alzheimer’s disease. 

In my experience, students and postdocs in 
the United States tend to be freer to explore 
ideas that are unrelated to the PI’s work. In 
Hong Kong, most students do research that 
has already been laid out by their supervisor. 
Both ways have pros and cons. Hong Kong 
attracts a lot of foreign professionals because 
it’s very accessible to cities such as Tokyo, 
Taipei, Seoul and Singapore. It is also a cos- 
mopolitan metropolis where old tradition 
blends with Western culture and post-modern 
trends — perfect for a foodie like me. 

Funding opportunities are perhaps the big- 
gest difference between the United States and 
Hong Kong. In the United States, there are 
federal and non-profit foundations that fund 
small projects or fellowships for postdocs; in 
Hong Kong, there is very little, and most early- 
career scientists are unaware of international 
opportunities. 

I published my first neuroscience paper 
within 12 months of joining Karl’s lab, and 
have since secured grants from the US 
Alzheimer’s Association as well as from the 
Institute for Advanced Study. 

I have been a research assistant professor 
for the past three years. It is a ‘grey’ position 
between postdoc and fully independent assis- 
tant professor. I can apply for grants and con- 
duct my project independently, but I’m still 
affiliated with Karl’s lab. 

I'm also building my profile in the field 
through other fellowships, including one with 
the World Economic Forum, to explore other 
collaborations as I prepare to start my own lab, 
which I hope to do next year. 

To maintain international relationships, I 
make myself available for Skype calls in the 
evenings and on weekends. And, crucially, I see 
colleagues such as Shen in person whenever I 
can. The bond is stronger when you can talk in 
person and discuss current research together. 

My goal is to get a couple of high-impact 
publications before I start looking for my next 
job. Hong Kong is a small place, and space is 
limited owing in part to an influx of research- 
ers from mainland China, so I’m open-minded 
about eventually taking a position elsewhere. m 


INTERVIEWS BY VIRGINIA GEWIN 
These interviews have been edited for clarity and 
length. 
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Ua SCIENCE FICTION 


TRAUMAHEAD 


BY JEREMY SZAL 


when I see him: the warrior with the head 
swollen with nanomemories. 

My skin is raw and bleeding where my 
black CombatSkin has been shredded 
open by human gunfire. I can smell my 
sour wounds. My weapons harness chafes 
my neck and I can’t stop shivering against 
the icy wind. Nothing burns like the cold. 
But self-pity must wait. I grit my teeth and 
trudge over to the fallen warrior. I press my 
eight-fingered hand to the nape of his blood- 
stained neck, where his shattered helmet has 
exposed his bulging nanomemories. The 
nanofibres in my skin tighten, red lattice- 
work standing stark as the warrior’s neck 
peels from his body in luminescent cubes. 
His vibrating nanomemory cells float to my 
outstretched hand, disintegrating on contact 
and merging with my nanofibre matrix. 

It’s like having a bucket of ice-cold water 
drip down my spine as the memories spear 
into my head. We were told to store only the 
highlights. No unnecessary clutter, no memo- 
ries that others might share. I watch years of 
this warrior’s life play out in fragments: his first 
time seeing the sprawling cityscape we built on 
this frozen, tidally locked moon. Meeting his 
wife the night we gathered to watch the solar 
flare. The day he signed up to fight the human 
invaders, after they broke our peace treaty. 

The birth of his daughter. 

My throat is locked tight as I stand. My 
neck and shoulders prickle with sweat, 
scarred flesh expanding as the warrior’s 
nanomemories are rewritten to my bio- 
metrics. Fusing with my body, my con- 
sciousness on a molecular level. But the only 
memories I possess of Asher, my daughter, 
are my own. Her head resting on my chest 
as a child, wanting to listen to her father’s 
heartbeat. Her insistence on joining the 
armed forces when she became old enough. 
Wearing that stubborn, determined look that 
reminds me so much of myself at her age. But 
the memories are grey, distorted. They don't 
have the crystalline clarity of nanomemory 
infusion. They’re inscribed in pencil, not ink. 

I continue along the frozen battlefield 
in search of nanomemories, hoping I will 
find hers. Ash and snow melt on my shoul- 
ders and soak into my CombatSkin. Before 
long, I've scavenged the nanomemories of 
50 of our slaughtered warriors. Childhood 
upbringings, deaths, loves, disappointments. 
Scientific and technological breakthroughs. 
Seeing friends and family blown to twitching 


I *m crunching over the bodies of the fallen 
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Preservation strategy. 


bundles of flesh by human artillery. Brave 
soldiers donning their CombatSkins and 
thermal helmets and departing for the 
battlefield, knowing they will never 
return home. My numb muscles feel 
like flayed meat, my spinal column 
stretching as my body processes the 
constant stream of narratives. 

Some guessed the peace treaty 
between our race and the humans 
would not last. Our existence inter- 
rupted their colonization plans for this 
moon, after all. And if there’s one thing 
we know about humans: they do not like 
to share. Our Jhulivaan civilization pre- 
dates theirs by 15 millennia, but they 
have the numbers, the determina- 
tion, the sheer brute force to destroy 
us. It’s not enough to take our home; 
they want to drive us to extinction. Pre- 
tend that we never existed, that they are the 
only race in the Universe. Despite their small 
physical stature, stout limbs and excessive 
body hair, were so similar in appearance that 
perhaps they took our existence as an insult. 

Only, we were prepared for this. We’ve 
archived our memories, our histories, frag- 
ments of our civilization, in nanomemories 
to be retrieved upon death. But memories are 
based on emotion. Als process data and raw 
statistics. Emotional states are unreadable to 
them. The only thing that can store conscious 
experiences is another conscious being. 

I allowed the nanofibre matrix to be 
installed atop my spinal column. It houses 
these fragmented memories, laced with 
love, hatred, acceptance, rage, fear — and 
the events that led up to them. I rub the fuzz 
along my swollen neck, not quite believing 
that I’m becoming pregnant with the ghosts 
of an entire civilization. 

But no matter how many piles of mutilated 
corpses I sift through, no matter how many 
lifetimes and personalities I transfer into me, 
I cannot find my daughter. I cannot find her 
memories of me and our brief 90 lunar cycles 
together before the war. My weariness van- 
ishes, my chest heaving as I run from body 
to body, hoping I will find her so she can live 
on inside me. The sea of dead faces gazes up 
at me with eyes like bullet holes. But none is 
hers. The howling wind lashes at my maimed 
face. I want to sink to my knees and punch 
the ice until my hands are blocks of bloody, 

frozen flesh. 


> NATURE.COM I swallow, look 
Follow Futures: up. There’s a human 
© @NatureFutures warship in orbit. 
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and I’ve collected all the 
nanomemories I can eas- 
ily store. My stealth-ship 
is waiting to smuggle me 
off-world to the remain- 
ing survivors, where we'll stash our pool 
of nanomemories in a place inaccessible 
to humans. Let the haughty bastards think 
they’ve destroyed all trace of us. And one 
day, maybe a spacefaring explorer will 
uncover our time capsule and share who we 
once were with the Universe. 

Iexhale slowly and take one final glance at 
the smoking remains of the cityscape before 
crunching away over the ice. My nape and 
shoulders have swollen to twice their nor- 
mal size. But there's a gaping hole in there, 
like a wound. I think of my daughter’s face, 
her laughter, her anger, her stubborn bravery 
and outbursts, and all the things that made 
her mine. I know that one day, when I die 
or become senile, the memories will fade to 
smoke in my hands. But until then, as long 
as I can hold her, she'll be with me. 

I press my hand over my heart as I stumble 
away from my home, pretending it’s hers. m 


Jeremy Szal writes about galactic 
nightmares, little traumas and broken 
heroes looking for hope in dark worlds 
from Sydney, Australia. Get in touch at 
jeremyszal.com or @jeremyszal 
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